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1.  Summary. 


In  [2]  Ole-Johan  Dahl  and  the  author  studied  an  algorithm  for 
priority  queue  maintenance,  first  used  in  the  work  with  the  language 
SIMULA  in  the  beginning  of  the  1960's.  The  strategy  uses  special  binary 
trees  called  p-trees,  and  algorithms  to  maintain  those  structures. 

The  main  part  of  [2],  as  well  as  of  the  more  detailed  treatment  in 
[1]  and  [3],  was  devoted  to  a mathematical  analysis  of  the  efficiency  of 
the  structure  after  n successive  insertions.  Each  new  key  was  supposed 
to  be  independent  of  the  other  keys  and  to  have  equal  probability  of 
falling  in  any  of  the  intervals  defined  by  those  keys  already  in  the  queue. 

This  paper  is  concerned  with  the  efficiency  of  the  algorithm  after  a 
large  number  of  alternating  remove-best/insert-random  steps,  starting 
with  the  situation  after  n successive  insertions. 

The  famous  ergodic  theorem  of  Markov  chain  theory  ensures  us  that 
there  exists  a stationary  state,  called  the  stationary  p-tree  forest, 
which  the  process  approaches.  We  will  find  approximate  values  for  properties 
of  the  stationary  p-tree  forest,  as  am  application  of  general  methods 
which  will  be  developed  for  the  analysis  of  such  algorithms. 

Let  F denote  the  normal  p-tree  forest  and  S the  stationary 
p-tree  forest.  The  following  table  compares  some  of  the  aspects  of  these 
two  random  structures : 


F 


Expected  left 
path  length 

Expected  insertion 
time 


- 1 


ih2+ 

5 Hn  + 9 


1 

5 


Hn  + 0(1) 


i H2 

3 n 


S 

+ | Hn  + 0(1) 
+ | Hn  + 0(1) 


Expected  recursion 
depth 


2 

5 


Vi  + 


1 
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The  stationary  p-tree  forest  S is  more  "skinny"  than  the  normal 
p-tree  forest  F . Near  the  root,  S is  approximately  equal  to  F ; 
for  example  the  expected  right  path  length  tends  to  the  same  limit,  and 
the  probabilities  of  the  value  of  the  node  next  to  the  root  sure  nearly  the 
same.  However  at  the  end  of  the  left  path  S is  quite  different  from  F . 
The  expected  length  of  the  left  path  of  the  last  right  subtree  of  the 
left  path  is  shown  to  approach  1 , while  the  corresponding  value  of  F 
approaches  ^ . Similarly,  the  probability  for  the  node  next  to  the 
left  leaf  to  be  a , is  shown  to  have  the  approximate  values: 


in  S: 
in  F: 


2 . 1 
3 a-  (ert-1)  3n(n-l) 


a(a*-l)  + n(n-l) 


J 


if  a = 3 


4 < a < n-2 


and  if  a = 2 


S: 


2 2 

3 ” 3 


n 


In  Chapter  2,  more  general  aspects  of  the  queuing  phenomenon  are 
presented.  It  should  be  pointed  out  that  the  text  primarily  deals  with 
the  particular  problem  of  finding  measures  of  the  efficiency  of  the 
stationary  p-tree  fbrest,  despite  the  fact  that  some  of  the  methods  have 
obvious  generalisations. 


In  Chapter  3 is  found  a detailed  definition  of  the  stationary  p-tree 
forest  and  its  prerequisites.  We  also  discuss  a function,  the  characteristic 
left  path  polynomial  attached  to  the  forest,  which  will  be  essentially 
useful  later  in  the  paper.  By  arguments  in  Chapter  3 the  function  is 
defined  for  S . 

In  Chapter  4 one  will  find  a deductive  proof  of  the  probabilities 
for  the  value  of  the  node  next  to  the  left  leaf.  The  derivation  involves 
techniques  from  discrete  mathematics,  especially  involving  binomial 
coefficients. 

In  Chapter  5 we  collect  the  information  to  derive  the  measures 
for  S . 
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2 . Models 


2.1  The  Queuing  Phenomenon. 

In  the  general  case  of  the  queuing  phenomenon  we  have  a Source  (S) 
consisting  of  a number  of  independent  devices,  generating  units  to  be 
served  at  some  Service  Processor  (SP)  . SP  for  some  reason  (for 
example,  its  capacity)  will  not  serve  the  units  at  arrival,  and  therefore  it 
depends  on  some  type  of  Queue  Controller  (QC)  which  arranges  the  units 
in  some  kind  of  priority  sequence  according  to  key  values  assigned  to 
each  unit . qc  usually  makes  use  of  some  predefined  strategy  working 
with  special  types  of  storage  structures  in  the  queue  itself  (e.g.  linear 
lists,  binary  trees,  index  tables) . At  request,  the  QC  releases  the 
unit  having  the  best  key  value,  for  service  by  the  SC  (Best -In-First -Out 
(BIPO)  strategy) . 

The  process  of  placing  a new  unit  in  the  queue  is  called  an  Insertion  (I) 
and  the  process  of  taking  the  best  unit  out  of  the  queue  is  called  a Remove  (R)  . 


S 

QS 

QC 

SP 


Source 

Queue  Storage 
Queue  Controller 
Service  Processor 

Insertion 

Remove 


Figure  1. 
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We  shall  deal  only  with  the  queuing  process  and  will  assume  that 
the  units  consist  of  the  key  value  only. 

A very  simple  way  of  assigning  key  values  is  to  define  some  kind  of 
time  function  according  to  the  arrival  at  the  QC  . The  best  strategy 
is  then  probably  to  use  a simple  linear  list  in  the  QS  . However,  in 
the  general  case  keys  emanate  from  the  source  with  values  according  to 
some  distribution  function;  they  may  be  adjusted  by  the  QC  prior  to 
insertion  and  even  be  changed  during  their  stay  in  the  QS  . We  will 
use  the  term  key  pattern  for  the  complex  of  rules  according  to  which 
keys  are  assigned. 

The  queuing  process  may  be  regarded  as  a discrete  time  sequence  of 
events.  At  each  time  t (t  *1,2,3)...)  either  an  insertion  or  a 
removal  takes  place.  In  general  we  may  have  a case  where  the  event  to 
take  place  is  subject  to  selection  according  to  some  distribution  function. 

We  will  use  the  term  i/R-pattern  for  the  complex  of  rules  according  to 
which  the  insert/remove  sequence  takes  place. 

Maintaining  a priority  queue  requires  selection  of  a strategy  for 
the  structural  ordering  of  the  keys  and  algorithms  for  insertion  and 
removal  of  keys.  Linear  lists,  AVL-trees,  and  "heaps"  are  examples  of 
such  strategies.  Each  strategy  provides  algorithms  for  insertion  and  removal, 
as  well  as  a mechanism  for  representation  of  the  data,  and  we  shall  call  it 
the  queue  strategy. 

The  purpose  of  this  paper  is  to  study  a specific  combination  of  the 
three  elements  in  the  queuing  phenomenon,  as  described  in  the  next 
sections.  Some  of  our  methods  and  resultB  have  obvious  generalizations; 
however,  we  shall  not  attempt  such  generalizations  in  this  paper,  but 
concentrate  on  obtaining  results  for  our  special  case. 
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2.2  Models  for  Key  and  i/R-patterns . 

We  will  assume  that  our  source  generates  keys  as  an  infinite  sequence 
of  real  numbers 


X1'V 


•'Xs' ' 


being  independent  random  variables  chosen  according  to  the  exponential 
distribution  with  mean  \ (0  < x)  , having  the  density  distribution 

function: 


(2.2.1)  f(x)  = 


if  0 < x 
otherwise. 


Furthermore,  we  will  adopt  the  following  assumption 


(2.2.2) 


Upon  entry  to  the  queue  controller,  each  new  key  is  increased 
by  the  value  of  the  key  last  removed  frcm  the  queue. 


To  demonstrate  the  effect  of  (2.2.2)  we  give  an  example. 


Example  2.2.1. 

Let  the  first  five  keys  from  the  source  be 
0.8  , 1.9  , 1.1  , 0.1  , 2.0 
and  suppose  the  i/R-pattem  is 
IIIRIRRI 


Time 

I/R 

Key  frcm 
source 

Key  to 
the  queue 

The  keys  in  the  queue 

Last  key 
removed 

1 

I 

0.8 

0.8 

0.8 

0.0 

2 

I 

1-9 

1-9 

0.8  , 1-9 

0.0 

5 

I 

1.1 

1.1 

0.8  , 1.9  , 1.1 

0.0 

k 

R 

1.9  , 1.1 

0.8 

5 

I 

0.1 

0.9 

1.9  , 1.1  , 0.9 

0.8 

6 

R 

1.9  , 1.1 

0.9 

7 

R 

1.9 

1.1 

8 

I 

2.0 

3.1 

1-9  , 3-1 

* 
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Restricting  ourselves  to  a source  generating  keys  which  are  independent 
exponential  random  variables  is  not  uncommon.  Biasing 

the  keys  as  described  in  (2.2.2)  needs  some  motivation.  If  no  adjustment 
were  made  we  would  run  into  cases  where  we  would  have  smaller  keys  in  the 
queue  than  some  of  those  removed  on  earlier  stages.  Not  biasing  keys 
also  means  that  large  keys  will  have  a tendency  to  be  trapped  in  the 
queue,  because  smaller  keys  keep  coming  in  with  non-vanishing  probability. 
The  example  below,  quoted  from  [2],  gives  a practical  example  of  bias 
occurrence: 


Example  2.2.2. 


Let  the  source  contain  n (n  > 1)  independent  exponentially 
distributed  event  patterns,  with  common  parameter  X > 0 . 


The  n devices  each  deliver  am  event  time  X^  (j  = 1,2, ...,n)  to  an 
initial  queue.  From  that  time  on  the  best  key,  say  > is  executed  and 
the  device  k delivers  a new  key 

*k  - VE 

where  E is  exponentieQly  distributed.  Since  X^  is  the  smallest  of  such 
keys  in  the  queue  at  the  present  time,  we  have  a situation  conforming 
with  (2.2.1)  and  (2.2.2).  □ 


Suppose  A is  some  fixed  (i.e.,  not  subject  to  probabilistic 
changes)  i/R  pattern,  and  let  a(t)  (1  < t)  denote  the  t-th  event 
(i  or  R)  . (in  Chapter  3 we  will  concentrate  on  a few  such  a , 
at  present  it  is  left  unspecified. ) 

If  we  at  any  time  t are  left  with  an  empty  queue  (i.e.,  if  the 
number  of  I 's  having  occurred  is  equal  to  the  number  of  R 's  having 
occurred  up  to  and  including  time  t ),  we  clearly  are  in  a trivial 
situation  equivalent  to  the  original  state;  previous  counts  have  no 
effect  on  the  subsequent  ones.  Thus  we  may  neglect  this  situation. 

We  will  allow  A to  be  infinite,  but  will  assume  that  it  is  bounded 
in  the  sense  that  the  queue  never  will  contain  a number  of  keys  larger 
than  some  predetermined  number  M . 

The  latter  two  assumptions  may  be  formulated  as  follows. 


(2.2.3) 


Let  N^(t)  be  the  difference  between  the  number  of  I 's 
and  the  number  of  R ' s having  occurred  in  A up  to  and 
including  time  t . Then 

0 < N^(t)  < M 


for  all  times  t = 1,2,...  , where  M is  some  predetermined 
number. 


Kq  and  A together  uniquely  define  the  queue  at  all  times  t = 1,2,...  , 
when  the  initial  stage  (t  = 0)  is  defined  by  the  empty  queue.  The 
content  of  the  queue  will  be  denoted  as  follows. 


nt  *A(t)) 

(t)  (t) 

the  keys  in  the  queue  at  time  t , in  sequence 
according  to  their  arrival  in  the  queue 


the  value  of  the  key  last  removed  from  the  queue. 
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The  notations  apply  to  the  situation  after  execution  at  time  t (&(t))  . 
Initially 

n0  = 50  * 0 * 

Our  combination  of  and  A have  the  nice  property  of  leaving 

invariant  the  simultaneous  density  distribution  function  for  the  differences 
between  the  keys  and  the  value  of  the  last  removed  key,  as  stated  in 
the  following  proposition. 


Proposition  2.2.1.  Using  the  notations  above,  let  1 < t and  define 
the  stochastic  variables: 

Wj^  = Xj^  " \ 1 < j < nt  = n . 

Then  the  W *s  have  the  following  simultaneous  density  distribution  function: 


,ne"UW1+  W2  r * • ’ + Wn}  if  0 < w1,w2,  . . .,wn 


(2.2.U)  f (w^, Wg, . . . ,w^) 


otherwise. 


Proof.  The  proof  follows  from  standard  results  and  methods  of 
probability  theory. 

As  A(l)  = I , and  the  first  X from  the  source  is  exponent ially 
distributed,  we  have  n1  = 1 , 6 = 0.0  and  the  correct  distribution 

function.  So  the  proposition  is  true  for  t = 1 . 

Assume  the  proposition  to  be  true  for  some  t , 1 < t . 

If  &(t+l)  ■ I , let  the  new  key  from  the  source  be 
X *=  W ♦ ftt 

where  the  density  function  of  W is  given  by  (2.2.1).  At  time  t+1  we 
will  have: 
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”wi  ■ V1 


- "t 

and  the  queue  sequence: 

, (t+1)  (t+1) 

y Aq  j 


-<1>> 

t+i 


- y(^)  y\ 

- ^A^  , A.g  y\) 


The  W 's  at  time  t+1  are  therefore  defined  by: 


■r  - v 


j = 1>2,  . . .^n^^-l 


W 


(t+1) 


nt+l  = W ‘ 

As  W is  independent  of  . . .,W^+^  we  obviously  have  the  required 

nt 

simultaneous  density  distribution  function  at  time  t+1  . 

If  &(t+l)  = R , let 


and 


V(t)  = min(W^,W^,...,W^) 


Y^t^,Ygt\  . . .,Y^i  be  the  remaining  's,  conserving 

the  sequence. 


By  symmetry,  the  simultaneous  density  distribution  function 


for  V^^Y^,  ...jY^.  is: 
•L  nt-u 


^(vjy^yg,  ...,yn  _x) 


V e 


nt  -\(v+y1+  ...  + ynfc.1) 


if  0 < v < yi,  ...,y  x 

X 


otherwise. 


Removing  the  smallest  of  the  's  is  equivalent  to  removing  the  smallest 

of  the  's,  leaving  us  with  the  following  situation: 
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w 


i 


nt+l  * nt_1  » *t+l  " v 


„<->  . y'1)  - v<4) 


j a 1;  2^  • • • f • 


The  simultaneous  density  distribution  function  for  the  v/t+1^  * s is  hence: 

» nt  -X(ntv+(w1+...+w  ) 

, ...,w  ) * f n \ e x dv 

1 d °t+l  0 Z 


when  0 < w. , w„, . . . , w (0  otherwise)  because 

” * nt+l 


v(t)  + Y(t)  + 


+ + (w{t+1)  + ...  +w(t+1)  +V^h 


- nt  V^^  + (w{t+1^  + ... +W^t+1^)  . 

w j-  n. . . 


Simple  integration  yields  the  desired  density  function. 

Proposition  2.2.1  has  now  been  proved  by  induction.  □ 

Another  useful  property  of  our  (Kq, l)  complex  is  the  fact  that  a key 
to  be  inserted  has  equal  probability  of  falling  into  any  of  the  intervals 
defined  by  the  keys  already  in  the  queue,  as  is  readily  seen  from  the 
synmetry  properties  of  the  density  distribution  function  of  Proposition  2.2.1: 

Proposition  2.2.2.  Using  the  notations  above,  assume 
h(t+l)  - I . 

Let  X ■ W^.+6^.  be  the  key  to  be  inserted,  W being  distributed  according 
to  (2.2.1). 

Let  Z^Z^,...,!^  be  the  ordering  variables  of  X^X^,  ...,3^ 

t t 


Prob(X  < z[tJ)  - Prob(zjt)  < X < zj*])  - Prob(Z^t)  < X)  - . 

t t 
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The  results  In  Propositions  2.2.1  and  2.2.2  enable  us  to  replace  the 
continuous  key  pattern  Kq  by  a discrete  key  pattern  DQ  , described 
below . The  replacement  is  easily  seen  to  carry  no  loss  of  generality, 
for  queue  strategies  that  depend  only  on  the  relative  order  of  keys. 

The  key  pattern  DQ  involves  renumbering  of  the  key  values  in  the 
queue  at  each  step.  However  this  will  not  alter  the  internal  arrangement 
of  the  key 

equivalent  to  those  of  Kq  . 


Key  pattern  Dr 


— At  the  end  of  each  time  t the  queue  contains  a permutation 
of  the  integers  1,2,  ...,nt  . 

— If  A(t+1)  = I , the  source  generates  an  X from  the  set 


(2.2.5) 


* I ' ••• » nt + 1 } 


with  discrete  probability  distribution 


Prob(X  = x) 


1 

\+1 


Vx  e T 


Having  inserted  x in  the  queue  the  keys  are  renumbered 
according  to  their  size. 

— If  A(t)  = R , the  key  1 is  removed  and  the  remaining  key 
values  are  decreased  by  1 . 


Note  that  in  DQ  (as  in  Kq  ) all  permutations  (all  relative 
orderings)  are  equally  likely  to  occur,  and  that  inserted  X ' s (both  in 
Dq  and  Kq  ) have  the  same  probability  of  falling  in  any  of  the  n^+1 
intervals  defined  by  the  queue  keys. 


ABHHR 


2.3  The  Queue  Strategy:  p-trees. 

The  queue  strategy  p studied  in  this  paper  is  the  use  of  p-trees 
with  algorithms  for  insert  and  remove,  as  described  in  [1] 
and  [2].  In  these  papers,  as  well  as  in  [3]  and  [6],  one  will  find 
theoretical  and  practical  results  concerning  p . We  will  assume 


familiarity  with  p . 

Using  P , the  queue  structures  are  postfix  ordered  binary  trees, 
being  elements  of  a subset  of  the  set  of  all  binary  trees.  We  will  denote  by 


B^  the  set  of  all  binary  trees  with  n nodes  (n  > 1) 

' the  set  of  all  p-trees. 

(We  recall  that  a tree  TeB^  is  a p-tree  if  and  only  if 
each  node  having  a right  successor  also  has  a left  successor.) 

We  will  agree  to  define  B^  and  to  consist  of  one  tree,  viz. 

the  empty  tree  u>  • 

When  using  p-trees  we  will  adopt  some  conventional  notations. 


(2.3.1) 


Let  T e 3^  (2  < n)  . 

— The  length  of  the  left  path  will  be  denoted  t . 

— The  values  of  the  left  path  nodes  in  postfix  order,  from 
top  to  bottom,  will  be  denoted  by 

n*ql>q2>...>qT=1 


--  The  right  subtrees  of  the  t-1  first  left  path  nodes  (left 
leaf  excluded)  will  be  denoted  by 

agreeing  that  node  values  cure  adjusted  to  range  from  1 
upwards  (if  nonempty). 


I 

I 
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A p-tree  forest  F is  defined  as  the  pair  of  items 


where  $ is  some  probability  model  containing  for  each  tree  T 


a probability  PT  to  occur 


Using  the  key  pattern  DQ  , some  i/R-pattern  A and  p , will  at 
each  time  t leave  us  with  a p-tree  forest,  denoted  by 


being  the  pair 


II... I is  the  i/R-pattem  consisting  of  n successive  insertions 


where 


One  of  the  Important  properties  of  the  normal  p-tree  forest,  due  to  the 
recursiveness  of  the  insertion  algorithm  is  that: 


(2.3.3) 


The  set  of  all  right  subtrees  of  a fixed  left  path  node 
position  is  composed  of  a set  of  copies  of  normal  p-tree  forests. 


This  property  is  called  the  basic  p-tree  property  (BPP).  Formally,  the 
BPP  may  be  described  as  follows: 


Let  F = (y'n  , 4)  be  any  p-tree  forest  (n  > 2)  with 
4 = {PT  | T e 5(n)  } . 

Let  1 < j < (n-1)  be  any  fixed  left  path  node  position,  and 


define 


°id)  = % PT 


Vu 


for  all  trees  U : 


U e U T 
s=0 


(2.3.M 


= £ oSP 


(0  < k < n-j-1) 


*kJ)  ■ I’u0  - ^5T  U£,(k)j  • 

Then  F is  said  to  have  the  BPP  if  the  forests 

are  normal  p-tree  forests  for  all  j , k : 0 < k < n-j-1  ; 
1 < j < n-1  . 


A proof  of  the  fact  that  Fv  have  the  BPP  is  found  in  [1]. 

0 


WE 


■tmur  »■ 


2 -4  The  Characteristic  Left  Path  Polynomial. 

Adopt  the  notations  in  the  previous  section  and  let  T e , 
(n  > 2)  . The  polynomial 


(2.4.1) 


t-1  q q 

h_(z,v)  = E z ^ w ^+1 

3 =1 


is  called  T's  left  path  polynomial  (LPP). 


Let  F = (y^n  ,f)  be  any  p-tree  forest  (n  > 2)  . The  polynomial 


(2.4.2) 


Hp(z,w)  = E PT  hT(z,w) 
T e 


is  called  F’s  characteristic  left  path  polynomial  (CLPP). 

Being  a polynomial  in  z and  w having  terms  of  the  type  zaw^  with 
1 < b < a < n we  see  that  we  may  write 


(2.4.3)  Hf(z,w) 


_(F)  a b 

L Pa  b 2 w 
1 <b  <a <n  a,t> 


where 


(2.4.4) 


the  probability  of  a tree  in  F to  have  a and 
b as  values  of  adjacent  nodes  on  the  left  path 


For  convenience  we  will  adopt  the  conventions 


(2.4.5) 


^(zjw)  - Hp,( ZfV)  =o  if  n = 1 

h^Zjw)  = Hp(z,w)  - -z  if  n « 0 . 


From  the  CLPP  of  a p-tree  forest  F , we  may  deduce  the  expected  left 
path  length  Iy  . Because  each  tree  T has  exactly  one  more  node  on  its 
left  path  than  the  number  of  terns  in  its  LPP  we  find 


T = 1+  1^1,1) 

leading  to 

(2.4.6)  Lp  - 1 + Hf(1,1)  (0  < n)  . 

Assume  that  F - (y^,j)  (n  > 2)  has  the  BPP,  defined  in  the  previous 

section.  We  may  then  use  F ' s CLPP  to  establish  the  expected  number  of  key 
comparisons  (SF)  necessary  to  Insert  a random  x e , being  subject  to  the 
equiprobability  distribution  as  in  DQ  in  the  trees. 

We  split  Sp  into  two  parts: 

(2.4.7)  Sp  = SLp  + SRj, 

where  SL  is  the  expected  number  of  key  comparisons  involving  left  path 
r 

nodes,  and  SRp  is  the  expected  number  of  key  comparisons  involving  nodes 
in  the  right  subtrees. 


SL_. 

F 


Let  T be  any  tree,  and  use  notations  as  in  (2.5.1): 
if  x < 1 we  use  T comparisons; 

if  < x < for  some  J = 1,2,  ...,-r-l  we  use  j+1 

comparisons ; 

if  n < x we  use  1 comparison; 
leading  to  the  expected  number  of  left  path  comparisons  in  T : 

, T-l 


-SI 
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Now 


T-i  f , T-l  q q 

Z ,,  - £ Z Z J W ■*  1 

j=i  J dz  j =1  J z =w  = 1 


so  that 


SIT 


. + _i.  r avz,ir)i 

»*L  Jz=w  = l 


and  obviously 


(2.U.8) 


SLp  = 1 


± dHF(z,w) 

+ n+I  dz  ", 

L.  J z = w = J. 


SRy- 

Let  the  number  of  key  comparisons  necessary  to  insert  x in  the  right 
subtree  of  left  path  node  j of  the  tree  T be 

sT(x>J) 

provided  q^+1  < x < q^  . This  process  is  clearly  equivalent  with 

inserting  x-q.  in  B.  (where  node  values  have  been  adjusted),  denoted 

J 

by  8_  (x)  , because  of  the  BPP. 

Bj 


We  then  find 

SR„  «=  £ 


F Z PT  SI  *!<*’« 

T e 5 " J 1 XeTn 


Vi<x<q 


j 


n-1  n-j-1 

z t z 

).l  k.o  Ut,00 


Z.  n+1  Vx> 
x € Tv 


E PT 

(n)  1 


Te? 

VU 
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Using  the  notations  of  (2.3.4),  knowing  that  F has  the  BPP,  we  find 


and  hence 


where 


T e 

Vu 


PT  * 


„(3>  A)) 


SR. 


n-1  n-j-1  . , ..  , ,N 

- s £ si43)v, , 

j =1  k = 0 I^1  * 


J.k 


UeJ 


W 


x?rk  ^ ^ & ' %<» 


because  (?^,<pj^)  is  a normal  p-tree  forest. 

is  the  probability  of  finding  a tree  in  F with  right 

q1  qi+l 

subtree  j of  size  k . Each  time  a term  z d w J with 

q j -q^+1-l  = k occurs  in  the  LFP’ s of  the  trees  in  F we  get  a 
contribution  PT  to  the  corresponding  term  in  F ' s CLPP.  Summing  over  all 
possible  j ' s will  correspond  in  the  CLPP  to  summing  the  coefficients  of  all 
with  a-b-1  = k . Hence 
n-k-1 

i 

b 


o 

possible  z w 


n-k-1  , 

E a£J) 
J *1  * 


E s.(F) 

, n-H-kHjb 
) = 1 ' 


and 

(2.U.9) 

n-2 

SR,  = E 

k =0 

a n v"1  o(F) 

nfl  F<k)  b = 1 Wl»b 

Bringing 

(2.4.8)  and  (2.4.9) 

together  we  find 

(2.4.10) 

S_  - 1 + — 
F n+1 

faH(z,w)l  n-2 

-V-  * s 

L °z  Jz=w  = l k =0 

where 

k+1 


(n  > 2) 
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(2.4.11) 


11 T1  *<F> 

Pk  " b-1  ^flCfl>b  ' 


The  p ’ s may  be  found  as  follows : 

A 


<2A-12)  “f  V - I *f(  8 ■ §)  • 


The  quantities  S ^ are  known  from  [1]  and  [2]: 


(2.4.13) 


_ 1 h2  + 10  _ 1 (2)  . 28 
,(n)  " 3 Hm-1  9 itf-l  3 HnH  27 


,(0)  = 0 5 S (1)  = 1 

0 Fo 


(n  >2) 


Similar  to  the  methods  used  above  for  L and  S we  may  establish 

F F 

formulae  for  the  quantities 


*F 

HL, 


the  expected  length  of  the  right  path  in  F 

the  expected  length  of  the  left  path  of  the  last  right 
subtree 


Cj,  the  expected  recursion  depth 

all  quantities  being  examined  in  [2].  The  appropriate  formulae  turn  out 
to  be 


(2.4.14) 


(2.4.15) 


n-2 


rf  ■ 1 * A *oo  Ci-k-1 


k *0  F' 


n-2 


L-(*)  P^2,l 


k *0  F, 


(n  >2) 


(n  > 2) 
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(2.4.16) 


n-2 


cr  1 + 


k =0 


(n  > 2) 


(with  defined  in  (2.4.11)). 

We  shall  demonstrate  the  effects  of  formulae  (2.4.6),  (2.4.10), 
(2.4.14),  (2.4.15)  and  (2.4.16)  when  applied  to  the  normal  p-tree  forests. 


We  assume  n > 2 and  F. 

,(n) 


(n) 


Fq‘v  has  the  BPP  and  the  CLPP: 


(2.4.17) 


n a-1 


H (n)(z’w)  “ Z 
Fq'  a = 2 b 


^(’(n.l-bWb)  * ' 


The  latter  formula  was  established  in  [2]  on  basis  of  considerations  on 
the  correspondence  between  the  set  of  all  permuations  of  the  numbers 
1, 2, ...,n  and  F^n^  . 


From  (2.4.17)  we  deduce 


n a-1 


(2.U.18)  hf(„)  f1*1)  * ^ (75T-b)(„-bJ  * ) 


• SWn'1) 


and  (according  to  (2.4.12)): 


(2.4.19) 


n-2 

E 

k =0 


1 n a— 1 , _ _ . 

_k  ^ f 1 1 ^ a-b-1 

k ' a. 2 b,J1(<wl-bK"-b)  ) 2 


I£2  2(n-k-l)  k 

k?c^^Z 


and  finally 
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SM'iHtliA- 


(2.4.20) 


dz 


z = w = 1 a 


bfi((Rfl-b)(n-b)  + T^)a 


« (r*l)Hn  - -^1  . 


Inserting  (2.4.18)  - (2.4.20)  in  (2  A. 6),  (2.4.10)  and  (2.4.14)  - (2.4.16) 
we  find  (n  > 2)  : 


(2.4.21) 


» 


,(n) 


2H  -1 
n 


H»  * ^ °f0  <”-k-1)s 


,(k) 


n^2  ^ x 

%<“>  ' 1 \,0  \w  ' l n^n^T  * Jknnm 


rrm) 


RL 


(n) 


n-2 
L L 
k = 0 F, 


(k) 


C n(n-l)  + (ktl) (kf  2)  ) ~ 2 " n + n(n-l) 


n-2 


,(n)  = 1 + (n(n+l) ) ^ (n-k-1)CF(k)  * 


These  formulae  confirm  those  of  [2],  Detailed  treatment  of  (2.4.21), 
may  be  found  in  [1]  and  [5]. 

The  main  advantage  obtained  by  use  of  the  CLPP  relative  to  [1]  is  the 


establishment  of  the  term  SL  ^ from  (2.4.8). 
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5.  The  Stationary  p-tree  Forest. 


5.1  General  Considerations. 


Consider  the  key  pattern  DQ  (see  (2.2.5)),  the  queue  strategy  p 
and  any  p-tree  forest  F = where  n > 1 and 


(3.1.1) 


$ = (<p(T)  | T e ?(n)} 


Let  A be  the  i/R  pattern  consisting  of  an  infinite  number  of  alternating 
insertions  and  removals: 


(5.1.2) 


A(2s-l)  = I , A(2s)  = R 


s =1,2,  ... 


Suppose  we  start  at  time  0 with  F and  apply  A • At  each  time  t = 2s  , 


s - 1,2, 


we  are  left  with  a p-tree  forest,  denoted  by 


(3.1.3) 


>(s)  = 0?(n),*(s)) 


= [<P^(T)  | T e 7^} 


We  also  define  Fv  ' = F . 

The  sequence  F^,F^,  . ..,F^,  . ..  may  be  regarded  as  an  infinite 
Markov  chain,  where  the  possible  stages  are  the  trees  of  7^  and  where 
the  transition  matrix 

n = (mi, 

is  an  NxN  matrix  (N  being  the  number  of  elements  in  7'n^  ) whose 
elements  m,  . are  the  probabilities  of  mapping  tree  i from  7^  to 

±9  J 

f n) 

tree  j in  7^  ' in  one  complete  insert-remove  operation.  (We 
fix  some  numbering  of  the  trees  in  7^  .) 

To  demonstrate  this  transition,  let  us  consider  the  case  when  n = 4 . 
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Example  *.1.1.  The  left  column  in  the  table  below  contains  the  possible  i ’-ee 


(U) 

in  S'  , the  horizontal  line  contains  the  possible  x 's  and  the  table 
entries  are  the  resulting  trees  when  inserting  x and  removing  the  left 


The  transition  matrix  is  therefore 


2k 


and  one  complete  insert -remove  step  may  be  described  as 


f(wi)  . *<■).„ 


where 


is  the  row  vector 


(«P(S)(T1)  , <P(S)(T2)  , <P(S)(T3)  , <P(S)(TU))  . 

In  the  general  case,  if  we  agree  to  denote 

P(s)  = (cp^CT^,^8^),  ...,<P(S)(TN)) 
for  some  predetermined  enumeration  of  the  N trees  in  , we  have 


(3.ifc)  p^1)  = in  (s  >o) 

and 

(3.15)  P(S)  = P(0)  (s  >0)  . 

7H  is  a sparse  matrix,  the  number  of  positive  elements  in  each  row  being 
at  most  n+1  , while  N is  very  large  (consult  [2]).  However  it  is  easy 
to  see  that 


(3.16)  wf1  is  a positive  matrix. 

This  is  deduced  from  the  fact  that  gives  a positive  probability  of 

reaching  any  tree  in  n steps,  regardless  of  what  the  original  tree 

Tq  was. 

To  see  this,  we  refer  to  [1]  where  it  is  shown  that  any  p-tree  may 
be  created  by  selecting  an  appropriate  permutation  of  the  numbers  1,2, ...,n 
and  then  performing  n successive  insertions  using  p.  (Conversely,  picking 
any  permutation,  performing  n successive  insertions  using  p , of  course 
gives  us  a p-tree.)  Let  therefore  (a^,a2, . . .,an)  be  a permutation  of 
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of  the  numbers  1,2, ...,n  corresponding  to  the  tree  . Select  the 

x 's  to  be  inserted:  xi>x2*  * ’ *,xn  irrto  T0  121  such  a manner 

any  stage  the  inserted  nodes  are  larger  than  those  from  TQ  , maintaining 
the  order  according  to  the  permutation  (a^, a^, . . .,an)  . The  tree  T^ 
will  then  gradually  be  built  in  the  upper  part  of  the  tree  while  the 
original  nodes  will  be  removed  one  by  one. 

Since  flf1  is  a positive  matrix,  is  a regular  matrix  in  the  terminology 
of  Markov  chain  theory  (see  for  example  [4]).  The  famous  ergodic  theorem 
of  Markov  chain  theory  then  gives  the  following  statements: 


(3.1.6) 


There  exists  a uniquely  defined  p-tree  forest 
S = (?^,Y) 

with 

Y = (y(T)  | T e 3^) 

( s}  V 

such  that  Fv  / S in  the  sense  that 

S -*a> 

lim  £ |q>^(T)  - f(T)|  = 0 . 

8—  Tc3(n) 


The  probability  vector  P of  Y is  defined  by 
p = p m , If  ■=  1 . 

S is  independent  of  F . 


Example  3.1*2.  To  find  S for  n = we  have  to  solve  the  equations: 


P * — P + — P 

*1  5 15  2 


* K 


5 pi  * I p2  * I p5  * 5 pii 
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t 


^ 1 t,  ,2 

5 P1  5 P2  5 P3 


2P  + iP 

5 P3  5 4 


1 = P1  + P2  + P5  + P4  ‘ 

The  first  four  of  these  equations  have  a determinant  equal  to  0 , as  the 
column  sums  in  77!  are  all  1 . We  find,  for  example, 

P1  ♦ P2  * p5  * PU  - 1 


- - P + — P 

5 1 5 2 


+ \ P. 
5 4 


1 4 1 1 

5P1+  5P2+  5P3+  5P4 


iP  +iP  -Ip 
5 1 5 P2  5 P3 


giving  us  S for  n = 4 


b '-1 


S depends  only  on  Jl | , defined  by  A , and  it  is  therefore  characterized 
by  A , Dq  , P and  the  number  n . We  will  call  S the  stationary 
p-tree  forest  (of  degree  n ) . 


Since  starting  with  F = FQ  , the  normal  p-tree  forest,  will 
maintain  the  BPP  (basic  p-tree  property  (see  (2.3.3)  and  (2.3.4))  for 
all  Fv  ' , it  is  easy  to  see  that 

S = lim  F^ 

S — co 

must  have  the  BPP: 

(3.1-9)  The  stationary  p-tree  forests  have  the  basic  p-tree  property. 

We  also  see  that  the  characteristic  left  path  polynomials  (CLPP)  of  the 

(s') 

F ’s  must  approach  the  CLPP  of  the  stationary  forest,  in  the  sense 
that  each  coefficient  in  the  CLPP  of  the  ’s  approaches  the  corresponding 

coefficient  in  the  CLPP  of  S . We  shall  later  on  establish  a transition 
matrix,  corresponding  to  % for  the  coefficients  of  the  CLPP’s. 


3.2  Defining  . 

From  now  on  we  will  assume  that  n > 2 . We  shall  be  interested 
only  in  the  process  of  alternating  insert/reraove  so  we  define: 

1 

A.  (2t-l)  = I t = 1,2,  ... 

(5.2.1)  1 

A^^t)  * R t = 1,2,...  . 

using  Dq  , p and  the  initial  forest  F^n^  (the  normal  p-tree  forest) 


(3-2-2) 


„ will  be  denoted 

(t  = 0,1>2»  • • *) 

*_ee  forest  at  time  » ^ 

The  p-tree  ioi 

F(t)  . t,(“> , *(t)  i 
I o+a- i (t  - 0,1»2»  -••) 

and  at  time  2t*l  ^ 


t the  reMit.  of  toe  previous  ~«“ 

AcooriiM  ° tree  forest,  having 

I F(t)  fend,  to  a U-*.  «* 

1 „ „ eianents,  denoted  W 

(5.2.5)  I the  BPP,  “i 

1 sin)  * 

1 1 (1)  G(2),...  also  approaches 

Itl.  potato  s«t^--^J  ^ , denoted  here 
a limit,  vif  that  ohtaine 

- \ n+.Vi  -• 


j,  — 

h,  T^1’  • . „ th.  average  left  path  length, 

Ut«^“*'mMU,t  “ ,W  , etc  Using  the  methods 

- vey  comparisons  to  insert  * 1 «ill  denote 

number  of  hey  s Qf  the  forests 

, . n 2 U we  will  need  the  CL 
of  section 

e ty: 

CLPP  of  *i 

V z&  ** 

(3.2.M  Hl^ (Z,W)  l<b<a<n 


f by: 

CLPP  of 


(3-2-5) 


(t)  ,a 

(t)  - * , Pa,b  ‘ 

(z»w)  i<b  <a<n*l 
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(3.2.6) 


CLPP  of  S[n)  by: 
A[n^(z,w) 


1 <b  <a  <n 


(n)  a b 

' ' *7  W 

b 


' z w 


and  finally  the  CLPP  of  by: 


(3.2.7) 


B^nfl^(z,w)  = 


1 <b  <a  <rH-l 


(nfl)  a b 
<b  2 W 


The  interpretation  of  the  Qt's,  p's,  T| ' s and  ' s (see  2A.4),  together 
with  the  statement  (3.1.6),  makes  it  easy  to  see  that 


(3.2.8) 

and 

(3.2.9) 


lim  H^(z,v)  = A^n^(z,w) 

t -«® 


lim  I^(z,w)  = Bjnfl^(z,w) 

t 


In  the  next  section  we  will  establish  relations  between  these  CLPP. 

3.3  Relations  Between  the  CLPP's. 

Suppose  we  have  any  p-tree  forest  X with  the  BPP  and  the  CLPP: 


H(z,w) 


1 <b  <a  <n 


a b 
r . z w 
a,  b 


(X  having  n nodes) . 

We  will  establish  the  CLPP's  of  three  p-tree  forests  X1  , Xg  and 
Xz  as  functions  of  H(z,w)  : 

* * . 

--  x^  is  the  result  of  one  single  insertion  in  X . X.^  has 

ntl  nodes. 
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* #- 
X0  is  the  result  of  one  single  removal  in  X . X^  has  n-1 

nodes . 

# 

— X^  is  the  result  of  the  combination  of  an  insertion  and  a 
removal  in  succession. 

In  terms  of  the  notations  in  the  previous  section,  we  then  will  have: 


if 

X 

is 

F(t> 

(t  > 0)  , 

then 

X*  1.  0(t)  and  x;  la  F<M1> 

if 

X 

is 

«<*> 

then  X* 

is 

P(W1) 

if 

X 

is 

s<“) 

* 

, then  X^ 

is 

and  X*  la 

if 

X 

is 

T(n) 

* 

, then  Xp 

is 

g(n_1)  . 

* * * 

Having  established  the  equations  for  the  CLLP' s for  X1  , X2  and  X^ 
below  we  may  therefore  concentrate  on  one  single  relation,  viz.  the  one 
arising  from  the  relation 

(3-3.1)  if  X is  then  X*  is  ; 

as  we  indeed  will  in  Chapter  4. 

Below  a and  b are  integers  satisfying  l<b<a<n,  and  T is 
some  tree  in  9^  . 

Case  1.  X* 


# # 

The  CLPP  of  X^  will  be  denoted  H^(z,w)  . Lert  T have  a and  b 
as  values  of  two  adjacent  left  path  nodes: 


The  tree 


* 

T 


resulting  from  T is  then: 


if  x < 1 : 


if  1 < x < b 


new  left  leaf 


no  new  nodes  on  left  path 


no  new  nodes  on  left  path 


no  new  nodes  on  left  path 
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if  n < x 


0k.  x 

' V J.  * 


Summing  over  the  entire  forest  we  see: 


new  root . 


(3.3.2) 


<(*,»>  - & f w1  * £ J»l  v" 


v _ ( b a+1  b+1  a-b  a+1  1 

1 <b  <a  <n  a’bl  n+1  n_1  Z W 


b n+l-a  a 

r + - — z 

n+1 


Case  2.  Tt 

The  CLPP  of  Xg  will  be  denoted  Hg(z,w)  and  we  recall  the  CLPP’s 
of  the  normal  p-tree  forests:  H^(z,w)  from  (2.4.17). 

As  in  the  previous  case  we  assume  T to  have  a and  b as  adjacent 
left  path  node  values: 

If  1 < b we  get  / 


If  b = 1 we  have 
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(a-2) 

where  Bv  w is  the  last  right  subtree  of  the  left  path  having  a-2 

( a-2) 

nodes  (a  > 2)  . As  Bv  ' is  appended  to  the  left  path,  we  will 

-ft 

have  the  new  left  path  of  T : 


left  path  of  B 


(a-2) 


Summing  over  the  total  forest  X , recalling  that  X has  the  BPP,  we  find 


H*(z,w)  = F r za_1  wb_1 
2 <b  <a  <n 


(3.3.3) 


V , a-1  a-2  . ..(a-2), 

L,  r (z  w + K:  '(z,w)) 

<a  <n  a’1  0 


(The  latter  formula  being  justified  by  the  conventions  made  earlier: 
H^(z,w)  = -z 
H^(z,w)  =0  .) 


Case  3*  X, 


In  thi3  case  we  could  use  "geometric"  considerations  as  in  the  two 

* 

previous  cases.  However  we  may  establish  by  means  of  (3-3.2)  and 

(3-3.3). 


Introducing  the  convention 


rajb  =0  if  (a, b)  ^ [(c,d)  | 1 < d < c < n} 
we  have  from  (3.5.2) 


a b 
z w 


Hl(2’U)  ' l<b<a<n+l  ^ (<b'1>Vl,b-l  * (-^Vl.b  * <ml->>ra,b> 


1 2 1 1 n+1  n 

ii+T  z w + 2 " • 


Inserting  this  in  (3 .3.3)  we  obtain 


(5.3.10 


+2<f<n  ('V’lt"oa‘1)(z’w))((a-1)ra,l+(n-a,Vl,l> 


b+l; 


+ 1 z*  wn"l 

n+1 


3.4  The  CLFP  of  . 

From  (3.3.I)  and  (3«3«^)  we  obtain  the  following  polynomial  identity 
for  the  CLPP  of  the  stationary  p-tree  forest  » using  the  notation 

(3.2.6): 
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( .U.l) 


v V1  Jn)  a b _ *(»), 


a = 2 b 


S . A™(z,w) 


51-  ^ 


a = 2 b = 


5TI  • s0  ( (a-1) + (n-a)  Vi, i}  ( + Hoa'1}  ( Z> w) ) 

a = 2 


1 n n-1 
+ — r Z w 
ml 


Here  we  recall 


,00 


(3.1+  .2) 


k a-1 

j:  z 

a = 2 b = 


L r 1 . 1 > a b 

A (kn^bTT^bT  + J 2 w 


r(o)  . 


Hi  = 0 


(k  > 2) 
(k  = 1) 


and  the  convention: 


(3.U.5)  T|«  - 0 


if  (a,b)  ((r,s)  | 1 < s < r < n} 


(3.^.1)  is  in  fact  a set  cf  M = 0|  ^ ~ ^ simultaneous  linear 
equations  in  the  M variables 


q(n) 

a,b 


(1  < b < a < n) 


.(n) 


The  uniqueness  of  the  solutions  follows  from  the  existence  of  Sv  ' , but 
we  could  also  prove  it  directly  from  (5*^-l) • 

The  solution  of  J.^.l  for  the  first  few  n 's  proves  to  be: 
b 

n = 2 
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We  will  therefore  settle  for  approximate  solutions.  Fortunately 
we  need  not  have  solutions  for  all  to  establish  the  quantities 

described  in  Section  2.4,  it  will  turn  out  in  Section  5 that  in  order  to 
establish  L , S , R , RL  and  C for  s|n^  we  need  only  the  values  of 
the  corresponding  quantities  for  (k  > 0)  and  the  's  and 

the  ' s . 

In  the  next  chapter  we  shall  deduce  from  (3. 4.1)  an  equation  for  the 

/ \ 

| ' s and  find  an  approximate  solution  for  them. 


r 


Approximate  Probabilities  for  the  Next  Last  Node  Value  on  Left 


Paths  of  S 


(n) 


U .1  Summary • 

In  this  chapter  we  will  prove  the  following  formulae  to  be  true. 
Proposition  U.1.1.  We  have 

(l+.l.l)  = jtft-l)  + 3n(n-l)  + ^,1  ' °(n  ) (*»  < t < n ) 


(4.1.2)  T|W 


(4.1.3)  7] M 


i+  £ On  . {8_+  4R(2n  i+  0 
9 3 n ^27  3 Hn  J n °1  2 

£.2  \ 20  fHn^ 

3 3 n+9n+0^J  ’ 


(2) 

H and  Hv  ' are  the  harmonic  numbers: 
n n 


H_ 


n n 

E \ 
k = l k 


H. 


(2) 


n i 

E 4 • 

k = 1 k 


The  0(f(n))  notations  should  be  interpreted  as  follows: 
g(n)  = 0(f(n)) 


iff  there  exists  a constant  that 
|g(n)|  < m| f (n) | 


for  all  n = 1, 2,  . . . 
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I 


During  this  and  the  following  chapter  we  will  make  extensive  use 
of  standard  formulae  from  combinatorics  and  discrete  mathematics,  referring 
for  example  to  [ 5 ] • 

Please  notice  the  difference  between  the  Tl£n^  's  above  and  the 
corresponding  probabilities  in  the  normal  p-tree  forest: 

t(t-l)  + n(n-l)  • 


4.2  Linear  Equations  Involving  Only  T)^n^  ' s. 
The  goal  of  this  section  is  to  prove 


£ z*’2  ^“1  (Sl2>n,(n_1)  S (z+1)n" 


a = 2 


(It. 2.1)  + Z (^(a-l)  + Tl^1(n-a))(n-a+l)(z+l) 

& b 2 


a-2 


♦ (^^a_1)  + 11i+li(n'a))(x(a"1)(z)  + y(a’1)(z))  * 


where  1 * ® 


and 


, .\  k r-1  , _ . s— 1 r-s-1 

a-2>  x(k>w  - r?e 


/v\  k r-1  / ..vs-l  r-s-1 

(k.2.3)  r"l(«)  . Z Z !)(«,) 

r =2  s =l 


for  1 < k (k  ■ 1 leaves  empty  sums,  being  0 ) 
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i 

t 

d _ . 


In  (3.4.1)  we  multiply  both  sides  with  (n+1) 


and  move  the  double 


sum  to  the  left,  obtaining  the  equation: 


n a-1 


L Z Z\6((n - (a-b-1 

i = 2 b c 1 


(4.2.4) 


= 2 


V"1  + £ ((a-l)T)<“>  + (n-a)^>1)(zawa'1  + H(a-1}(2,w))  . 

a — a 


We  introduce  the  new  quantity: 

(4.2.5)  o£l  = (n-a+2)(n-a+l)(  )t£"£  for  all  a,b  . 

The  left  hand  side  coefficients  then  transform  to  (provided  1 < b < a-2  < n-3)  , 

(nfi-b )t£l  - (a-b-l)71^1  - (n-a)^1#bfl 

^b^1* gatbfl^a-b“1) Vltbf^n-a) 

(n-a+1)  (n-a+2)  ( “"J£g  ) (n-a+l)  (n-a+2)  ( n^+2  ) (n_a)  (n-a+1)  ( n^1  ) 


(n-a+1)(n-^l) 


^ °a,b  ' °a,bH  " °a+l,bfl^ 


This  transformation  is  easily  checked  to  be  valid  for  the  cases  a = n , 
and  b = a-2  or  b * a-1  with  2 < a < n also. 

We  use  this  result  in  (4.2.4)  and  multiply  each  term  zawb  with 

(n-a+i)(n-ii) 

obtaining 


41 


£ ^ a b/  (n)  (n)  (n)  \ 

L E Z w {oil  - b+i  • Vl,b+1> 


(4.2.6) 


a - 2 b-1 


zV1  ♦ E ((a-l)T]^]+(n-a)Tl^1)((n.a+l)2awa-1  + K(a-1)(z,w)) 


where 


(4.2.7) 


K(k)(z,w) 


k r-1  , 

= £ £ ( 

r = 2 8=1^ 


iyn)(",iT)(n!!;*1)'v 


y>  a b/  (n)  (n) 

E Z W (<  ' - 0Ka  ' 


a =2  b =•  1 


(n)  - a(n)  - a(n)  ) 
a,b  a,b*l  a+l,b*l' 


v ay^  a (n)  y1  ay>J"  a b-1  <r(n)  v T „&-l  b-1  „(n) 
£ Lj  z w ctv  ' - Lj  izw  o'  / - £ £ z w a'  ' 


n a-1 


n a-1 


a = 2 b = 1 


a=3  b =2 


a = 3 b = 2 


(1 . i . a ) r i*«b  >> . ( 1 + 1)  e »<n’ 

L ” *>  J Kb<.<n  a>1>  ^ S'a-a  a>1 


so  that  by  putting  w = , followed  by  division  of  z*(z+l)  we  obtain 

from  (4.2.6) 


v a -2  , «n-2 

£ Z 0 1 - (Z+1) 

a =2  a,:L 


E ((a-l)71^+  (n-a)^^)  (n-a+l)(z+l) 


z(z+l) 


We  have  from  (4.2.7) 


K(k>(  ■ - ^ ) 

— 


k r-1 


■ ^ ,^(W.Cj(k.b|  + Tt^)(n-”1)(n-r-l)<2+1)S'1'r'S'1 


= X^(z,w)  + Y^(z,w) 


so  we  arrive  at  (4.2.1),  having  realized  from  (4.2.5): 


4.3  Properties  of  X^(z)  and  Y^(z)  . 

The  complexity  of  (4.2.1)  is  primarily  due  to  the  sums  involving  the 

functions  X^(z)  and  Y"k^(z)  as  defined  in  (4.2.2)  and  (4.2.3). 

In  this  section  we  sha.1.1  concentrate  on  simplifying  these  polynomials . 

We  will  make  use  of  the  following  differential  operator: 

(4.3.1)  <7i  = -(z+1)  ^ ; 0 < J 

so  that  d j applied  to  a function  f(z)  is 

tfjfU)  = jf(z)-(z+l)  . 

In  particular  we  will  make  use  of 

(4.3.2)  ^(z+l)1  = (j-i)(z+l)i  (all  i ). 

This  section  contains  the  proof  of  the  following  three  statements, 
all  valid  for  1 < k < n-1  : 
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(M.5)  an.2x(k>(^  - s a8‘2((") -(■>-*)<£) -<k)> 

s = 2 


(U.3.U)  a _Y(k)(z)  = £ zS'2(°) 

n 8=2 


(U.3.5)  C7n_2(x(k)(z)+Y(k)(z))  = k,(z)  - 


From  (k.2.2)  we  obtain: 


xm(2)  , s1 


8«1  <T-s)(k-B+l)  r = s+1  r-B-1  ' 


The  inner  sum  of  this  expression  may  be  written  as  a polynomial  in  (z+1)  : 


Lr-s-1,  n-s-1  \ 
* r-8-1 

r = s+1 


z 'if1  (“Ti)( r“®"1)(2+i)t(-i)r‘s"t'1 

r = s+1  t = 0 r_S_i  Z 


k-s-1  1 k 1 _ f i . 

z (z+d*  z (nT 

t = 0 r =t+s+l 


k-s-1 


,t,  n-s-1  N , n-s-t-2  s , .vk-s-t-1 


so  that 


v k^  k’®_1  (z+l)8+t-1(n-s)  f n-s-1  x / n-s-t-2  w .*k-s-t-l 

X'  \z)  + \ (k-s)fl-^ir  ( t k-s-t-l  ^ 

S = 1 t = 0 


*s  \ PJ»I'  '’VPflP  ■»-**» ^'^r  - 


, n-s-1 v , n-s-t-2  v 
1 t n k-s-t-l  > 


. n-6-1  , , n-8-t-l  \ k-s-t 
' n-s-t-1  ’ ' k-s-t  ' n-s-t-1 


, n-B-1  , , n-s-t-1  v k-s-t 
' n-s-t-1  ' n-k-1  ' n-s-t-1 


/ n-s-l  w k-s  s k-s-t 
' n-k-1  ' k-s-t  ' n-s-t-1 


, n-s  , k-s+1  . , k-s-1  v k-s 
n-k-1 ' n-s  ^ k-s-t-l  ' n-s-t-1 


so  that 


/.  \ k— 1 k— s— 1 / \ s+t— 1/  -\k— s— t— 1 . _ 

r(k)/_\  v (z+1)  (-1)  / n-s  k-s-1 

8-1  t-0  (n-^-t-1) { n-k-1  H k-s-t-l > 


s+t— 1 


Applying  an_2  to  (z+1)  we  obtain 


«(*■!) *H>“1  « (n-s-t-1) (z+1) s+t"1 


so  that 


i7n.2x(k)(z)  - if  ( -^(z+l)-1  k|)1  (z+l)t(-l)k-s-1-t(k-^1) 

S = 1 t *=  0 


8-1  k-s-1 


k-1  8-1 


y'  V (k-8-l)+(s-t-l),  n-s  Ws-1N 

.1  tTo  ("+-i)(  4 > 


k-s  . . _ k-1  , 

e z"-4-2  s > 

t-0  s -t+1  n k x t 


V, 


k£  ^[T  ^ (n^!1)(s;1))-(n^1)(k;1)-( £>(?>] 

t =o  L\.s  =t+1  J J 


i 2k“t‘2[(n-k+t)’(n‘k)(k-t^)-(kkt)] 

u = U 


Changing  the  summation  index  to  k-t  we  obtain  (U.3.3)* 
From  (U.2.3)  we  see: 


Y(k)(z) 


v (n-rH)  f r-s-1,  , \S-1/  n-s 


r = 2 r*<r“x>  8-1 


if  E zr  (a+l)  (n.^^) 


The  inner  sum  may  be  transformed  as  follows: 
r-1 

v r-s-1,  ^..xS-l,  n-s  » 

2 <2+1>  (n-rfl> 


r-1 

s-l 

8-1  x S- 

t )Z 

z 

E ( 

s = 1 

t =o 

r-2 

r-1 

Z 

r-t-2 

z 

Z 

t =0 

s =t+l 

r-2 

Z 

t =0 

r-t-2 , 

z ( 

n 

n-r+t+2 

r-2 

z 

t =0 

2 <*> 

• 

-1 v , n-s 


Using  this  transformation,  and  applying  we  obtain: 

a tW(z).  S zf  jSfajr  (J  )(».*■  (»1)  . Xs'1) 
n r = 2 s =0  r (r  s 


= Z 


[r^?  / n-1  v s r-f2  , n-1  x s-1  I 

Z n-(  )z  - Z n-(  ,)z 

8—0  8 s -0  81  J 


\ 


v>  n-r+1  _ , n-1  \ r-2 

rt2  FT73T 

V / n v r-2 
£ ( _ )z 
r =2 


and  (4. 3.U)  is  proven. 

From  (4.3-3)  and  (4-3.4)  we  find: 


^n-2(x(k^^  + Y(k)(z)>  = ^ zS_2(^)  -(n-k)  £ 2S"2(^)- 


s =2 


s =2 


+ (<7n  Y^(z))  v 2 Y(k)(z) 


= 2{an  Y^(z))  -2  Y^(z)  - (n-k)  — ~ 1 - l2*1.)*  ~ kz  -1 

z 


Now: 


{an  Y(k)(z))  -Y(k)(z)  = Y(k)(z) 


and 


U - <-«  - (-« 


k-1 
z 


(k-l)(z+l)k~2  (z-t-l)k~1  - 1 

z ' 2 

z 


k-1 


= (n.k)  iSL1) zl  . C2-*-.1} 


proving  (4.3-5) • 


4.4  Revision  of  Equation  (4.2.1). 

In  this  section  we  will  obtain  a simplified  version  of  (4.2.1), 


using  the  results  of  Section  4.3.  We  will  use  a new  notation  for  the 

unknown  quantities  : 

a,i. 
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(U.fc.l) 


n _ Tl(n  ) 

^a  " ^b-a,  1 


The  main  result  of  this  section  is: 

E (zr(r«-2)  - z1^1  r)  £ ( T )(  ^ ) n p^-l)1 

r = 0 s*0 


(U.U.2) 


{=■')- 
\ r = 0 / 


(n  - (n-l)pn_2)z' 


n-1 


+ £ (zr(r<-2)  - zr+1(r+l))(pr(n-r-l) +pr_1  r) 

r = 0 


As  before  we  use  the  convention 


o - 0 ifa<0orn<a. 


We  will  split  (1+.2.1)  into  three  sums 


a,  1 


n-2 


(*.M) 

S1 

n 

= £ 
a = 2 

(4.U.1*) 

S2 

= (z+l) 

(U.U.5) 

S 

n 

= £ 
a = 2 

so  that 

(U.U.6) 

Si  = 

(U.U.5)  S?  = £ (T]^(a-l)+Tl^:L(n-a))((n-a+l)(z+l)a-2  + X^-l)(z)  + Y(a-l)(z)) 


S2+  S5 


We  also  introduce  the  notation 

(fc.U.7) 


s * a+2 


In  (U.3.1)  we  defined  the  differential  operator  a ^ • The  corresponding 
integration  operator  will  be  denoted  . We  have 


kQ 


(U.t.8) 


a.  T,  f(z)  = f(z) 

O 


(14.4.9) 


T.  0 = C(z+1)^  (C  constant) 

d 


(U.U.1Q)  TjCz+l)1  = (z+1)1  + C(z+l)'j  (C  constant  and  i^j) 


We  will  apply  the  operator 


(fc.4.11) 


1C  = z a 3 a r 
n n-1  n-c 


to  (1+.2.1),  and  then  rearrange  the  polynomials  using  (z+1)  as  variable. 
For  Sp  we  find 


(U.U.12 ) 


UO  S2  = Cz2(z+l)n'1 


= C^z+l)1^1  - 2(z+l)n+  (z+l)n-1) 


where  the  constant  C is  assumed  to  represent  the  integration  constant 
for  the  entire  equation. 

For  S1  we  find 

S = z I2  (z+l)b(  a;2)(-l)a-2_b("'2)T)^  n(n-l) 

1 a = 2 b = 0 b a"2  a,i 


n-2 


- Z <z.l)b  Z (":2)n(i-l)(“-|-b)(.l)a-2-b  Vn> 

p =0  a =V>*0  D a ^ 0 a>X 


a = b+2 


n-2 


£ (z+l)a(  n~  )n-(n-l)K 

- Cl  cL 

a =0 


and 


h9 


* S1  - .=  z2  C (73'a) ) <-«*<  ° :2  >-<«-»«» 

leading  to 

n 2 

(4.4.13)  It  S = E ((z+l)a+2 -2(z+l)a+1+  (z+l)a)(n-a-2)(n-a)(  n_1)n  K 
a = 0 a a 

For  Sj  we  will  have  to  involve  ourselves  in  more  complicated 
calculations,  (see  (4.3.4)  and  (4.3*5)). 

ta( (n-a+1) ( z+1) a"2  + Xa_1( z) + Ya_1( z) ) 


:z+i)a_2-i 


(n-a+2) (n-a) (z+l)a~2  z2  + 2 E zS(n) 

s = 2 5 


a-2)(z+l)a-5  (z+l)a'2 


J)> 


= 2 Z)  zS(  a ) + (n-a) (n-a+2) (z+l)a  2 z2  - (n-a+2) (z+l)a~2  z - (z+l)a"^ 


+ n z + ( z+1)  - 2n  z - 2 


Hence  we  may  write 


It  S,  = U+  V+  W 


where 


(4.4.14)  U = E x u 

a = 2 a a 


(4.4.15)  V = E x v 

a * 2 a a 
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(1*. 14.16)  W = £ X w 


a =2 


a a 


where 


a-1 

u =2  £ zS(n) 

a s =0 


v ■=  (n-a)  (n-a+2)  (z+l)a  z2  - (n-a+2)  (z+l)a  z - (z+1)  1 


w = n z + ( z+1)  - 2 n z - 2 
a 


Now 


u = 2 Z1  zS(")  = 2 S1  £ (z+l)t(-l)S-t(”)(") 

a s =0  S s =0  t =0  S t 


= 2 E1  (z+dV)  E1  (^)(-i)s_t 

t = 0 s = t 

= 2 i1  ( z+i)1  ( j j ) ( -l) a_t  ■1 

t =0  t a t’1 

u — 0 

- 2n(£)  E1 

O =S  0 


so  that 
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u = 


n 

£ x u 
a = 2 a a 

n 

I 

a =2 

+ T,  ^n) 
a =2 


^ l<n-a)a.(£)  Zo  ^ 

■ 2n(”-1>  [ ,f  2 X ti<  ”-2  > 1 ( ^ > (-«,-‘-1(«i)t  tktet 


2n(n-l) 


n a-1 

2_i 

a = 2 t 


= 2n(n-l) 


a = 2 t 
n-2 


■ ^(rs  (z+1)t+1<”‘2)  -*<»-!>#> 


And  hence 


(J..U.17) 


n-2 


U * 2n  E (z+l)a+1(n21)Ka-2(n-l)^ 


a = 0 


For  the  sura  W we  find 


n 

W = 72  x w 

a = 2 a a 


t X (-  1Z+Z-1) 


a =2 
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= ( - nz+z-1)  E Tl[n^(a-1)+  E (n-a+1) 

L a = 2 a,J-  a = 3 a-’± 

= (-nz+z-1)  £ E n " 


( n ) 

Knowing  that  E 7p  ' = 1 we  obtain 
a =2  a,± 


(U.U.18)  W = (n  - (n-l)T)2^1)((n-2)  - (n-l)(z+l))  . 

Applying  to  to  (4.4.6),  inserting  (4.4.12),  (4.4.13),  (4.4.17)  and 
(4.4.18)  we  obtain  equality  between  two  polynomials  where  the  maximum 
exponent  of  (z+1)  is  (n+1)  , occurring  only  in  to  S2  (4.4.12).  Hence 
the  integration  constant  C = 0 and  we  have  transformed  (4.2.1)  to  the 
equivalent  identity: 


(4.4.19) 


n-2 


E ((z+l)a+2  - 2(z+l)a+x  + (z+l)a)  (n-a-2)  (n-a)  ( n_x  ) n K 
a = 0 a a 

- L (z+l)a+1(  “I1  ) 2nK  -2(n-l)^n|  + £ x v 

a = 0 a a £f,Xa-0aa 

+ (n-(n-l)T|M)((n-2)  -(n-l)(z+l)) 


. a+1 


n-1 


where 


x = 
a 


(O-1)  * 


and 


va  = (n-a) (n-a+2) (z+l)a_2  z2  - (n-a+2) (z+1) a"2  z - (z+l)a_1 

= (n-a)  (n-a+2)  (z+l)a  - 2(n-a+l)2(z+l)a  i'+  (n-a)  (n-a+2)  (z+1)* 


(n-a+1)  (z+l)a-1+  (n-a+2)  (z+1) a-2 
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In  (U.U.19)  "the  first  sum  on  the  right  hand  side  is  moved  to  the 
left  hand  side,  and  we  use  & throughout  the  identity  to  simplify 
the  terms: 

((z+l)a+2  - 2(z+l)a+1+  (z+l)a)  (n-a-2)  (n-a)  -2(z+l)a+1] 

= Tn( (z+l)a+2(n-a-2) (n-a)  - 2(z+l)a+1(n-a-l)2  + (z+l)a(n-a-2) (n-a) ] 
= [ (z+l)a+2(n-a)  - 2(z+l)a+1(n-a-l)  + (z+l)a(n-a-2) ] + C(z+l)n 

= z[ (z+l)a+1(n-a)  - (z+l)a(n-a-2) ] + C(z+l)n 

for  some  constant  C . 

Furthermore 


^nva  = (n~a+2)(z+l)a  -2(n-a+l)(z+l)a-1+  (n-a)  (z+l)a“2  - (z+l)a‘1+  (z+l)a-2 

= z((n-a+2)(z+l)a-1  - (n-a+l) (z+l)a_2) 

(neglecting  the  integration  constant) . 

Application  of  j;  to  (U.U.19)  hence  yields 

L z[ (z+l)a+1(n— a)  - (z+l)a( n-a-2) ] n ( n'1 ) K 
a =0 


(U.U.20) 


- n * (n-(n-l)T^">)(^  . (*♦!)) 


+ L (Tla?i(a_l)  + Tla+l,l(n"a))((n_a+2)(z+1)a"1  " (n-a+l)(z+l)(a-2))z 
+ C(z+l)n 


The  coefficients  of  zn  are  seen  to  be 

(n-(n-2))n(^)Kh_2  = (^“{(n-l)) -(n-nf2)  + C . 


5»* 


Now,  following  ( ^ ^ . 7) 


_ , n-(n-2)-2  v_(n),  ,»n-(n-2)-2  _ _(n) 

n-2  " (n-(n-2)-2;T>n,l(  j " %1 


so 


C = \#1[2n(n-l)  -2(n-l)]  = 2(n-l)2  T]^ 

Going  back  to  (4.2.1)  easily  gives  us  T]^n^  : 

Tl^(  n-2  )n(n-1)  = 1 + 7'n”i(n*l)(n"n+l) 


(n-1)2 

and  hence  C = 2 in  (4.4.20). 

We  insert  this  last  result  in  (4.4.20),  divide  by  z , and  then 
change  our  variable  from  (z+l)  to  z , obtaining 


n-1 

z 


a =0 


(4.4.21) 


Recalling 


n-2 

Z 


a =0 


[ za+1(n-a)  - za(n-a-2)  ] n ( n"1  ) K& 

= Z + ^i+l, i(n-a) ) ( (n-a+2) za_1  - (n-a+l)za'2) 

+ 2 - (n- 1>  . 

the  definition  of  the  p 's  in  (4.4.1)  we  find  from  (4.4.2) 

El 

(za+1(n-a)  - za(n-a-2) ) n ( n‘1  ) K& 


n-2 


n 


Z («^(n-a).«*(n-a^))n(n;A)  Z (^a‘")pn  8(-l)! 

a =0  a s=a+2  s a_<f  n s 


n-2 


Z (za+1(n-a)  - za(n-a-2))  n ( n"”^a)  Z ( n'a"2  ) p8(-l) 


n-1 


n-a-2 


n-a-2 


, n-s-a-2 


a =0 


s = 0 
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n-2 

E 

a =0 


, n-a-1 . 

(z  (a+2) 


n-a 


■2  , n-1  v 


a 

E 

B =0 


(:)ps(-Da- 


and 

2 (^{(a-i)  + ^a+1, 1(n-a) ) ( (n-a+2)  za-1  - (n-a+l)za‘2) 

= E ((a+2)zn  a 1 - (a+l)zn"a_2)(p  /’n-a-l)  + p a) 
a =0  a a+J- 

Inserting  the  two  last  results  in  (4.4.21),  dividing  by  z11-1  and 
finally  changing  the  variable  to  l/z  we  obtain  (4.4.2). 


4-5  Series  Expansion  of  the  p 's. 

a 

The  polynomial  equation  (4.1*. 2)  contains  n equations  and  the  (n-1) 

variables  (pQ, . . ., Pn_2)  • However,  by  putting  z * 1 we  will  see  that 

the  equations  are  dependent.  Furthermore,  it  is  not  hard  to  see  that  the 

1 

equation  obtained  from  the  coefficients  of  z may  be  ruled  out, 
leaving  an  independent  set  of  linear  equations. 

In  this  section  we  shall  obtain  series  expansions  for  the  p&  's, 
making  it  possible  for  us  to  obtain  approximate  solutions. 

The  following  facts  are  trivial. 
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(4.5.1) 


0 < fa  < 1 

0 

< a 

< n-2 

p.  - 0 

a 

< 0 

or  i 

n-2 

^ Pa  = 1 
a =0 


(n-l)‘ 


We  shall  prove  the  following  proposition: 
Proposition  4.5.1.  Define  for  1 < t < n-2 

(It. 5.2)  a<°)  . _S±2_ 

v t 3(n-l)  ( n v 

v t+1  ' 


(*>•5.3)  a'"1’  . 1 


Then 


i (*)k(kKL)a(r> 
t(t+1)(Ul)  k=1  -3=1 


(,.5.«  . I of  TF4Lly 


(^•5-5)  Pt 


J 


-H*  2 

(n-ir  r=0  1 


(4.5.6)  0 < s£r)  < b^0)Q  J 


1 < r 


(0  < r) 


(0  < r) 


1 < t < n-2 


1 < t < n-4 


The  constant  ^ i8  uniform  for  1 < t < n-4  , and  is  not  very 

well  optimized.  As  we  shall  see  later,  (4.5.6)  does  not  hold  for 
t = (n-2)  or  (n-3)  . 
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Proposition  (U.5.2)  below  gives,  for  each  t = 0,1,2, .. .,n-2  , 
as  a linear  function  of  c1, pg, ...,pt  1 • 

Proposition  U.5.2. 


Ci)  ( t-1  \ 

(U.5.7)  p = — jP ( O + £ P 3 ) 

1 t u = 0 u a,u  / 


0 < t < n-2 


where 


„ _ 1 2 1 . 1_. 

t 3 (n-t)  (n-t+1)  n(n-l) 


0 < t < n-2 


and 


t,u 


(u>  + <«Jl><*2> 


Cl>  (uf2)^-U-2) 


- (ufl)(u+2)  £ 


(r) 


r =u+l  r.(i+l)(pf2)(  £j) 


(0  <u  < t-1  , 1 < t < (n-2)) 


Solutions  of  equations  like  (4.k.2)  often  involve  one  or  more  cleverly 
selected  substitutions,  in  our  case,  the  following  sequence  of 
substitutions  are  not  unnatural  choices: 


(U-5.8) 


ct  = (n-t-l)pt  + t pt_1 
dt  = (t*2)ct-t  ct-1 


■t  ■ z • 

j-o  J 0 
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The  direct  correspondence  between  the  e 's  and  the  c 's  is 

t t 


to  be: 


Bt  = Z (-i)t"J(^)(j+2)c  - z (-i)t_J(*)jc 

J-0  J J j=0  J J_1 

Z (-i)t”J((  ^ )(J+2)  + ( A )(J+I))c, 


A =0 


= (t+2)  Z (-l)t_Jv^) 


j =0 


J 'A 


= (t+2) 


r Z (-D*^* 

L j =o  0 


) (n-j-l)p  + Z (-l)1'^*)  Jp. 
J j =0  J 


- (t+2)  S Pj(  J)((n-j-l)  -(t-j)) 


j =0 


so 


(U.5.9)  e = (t+2)  (n-t-1)  E (-l)t'J(t)p. 


j =0 


J ,r\J 


From  ( 1+-5-9)  we  easily  deduce 

C.5.1D)  Pt  . z 


0 < t < n-2 


Inserting  (4.5.8)  into  (4.4.2)  we  obtain 
n-2 

Z (zr(rf2)  - zr+1(r+l) ) (p  (n-4-1)  + p r) 
r = C r'x 


n-2 


Z (zr(r+2)  - zrfl(r+l))c 


r =0 

n-2 


Z z ((r+2)c  -r  0 ) - z11-  (n-l)c 

r = 0 r-+  n- 


seen 
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L zr  4 - zn~1(n-l)t 


^ *r  £ ( £ )«  - 2n"1(n-l)e 

r =0  t=0  r r 


and  from  (^.5.9) 


^l)(;)nps(-l)3 


t n_1  \ 

(rH>n 


-oaain  - to 


L (^2)-^  r)(^)^er 


t 2r  + zn_1((n-l)p  -n  - (n-l)c  ) + "if  zr  £ (+r)e+ 

r=0  n n'2  r=0  t=0  * * 


from  which  we  obtain 


2+  E <?>et 
t =0  t t 


0 < r < n-2 


when  neglecting  the  teiros  z11-1  . 

Multiplying  each  equation  (l4-.5»ll)  with  r-(r+l)  and  summing  from 
9 through  s (0  < s < n-2)  we  obtain 

r?0  («l)r(rfl,er‘r?0 

s^  sr 

= 2 D r(iM-l)  + E £ (f)e,  r(r+l) 

r =0  r = 0 t = 0 Z Z 
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2 

The  first  sum  being  j s(s+l)(s+2)  we  see 


(4.5.12) 


2 s+2 

e = — + 

s 


* n ' n — S S (t)r(r*-i)et 

5 (s^)  s(s+l)(  J\,  ) r = 0 t=0  t 1 


s+1 


if  (1  < s < n-2)  . 


As  (it. 5 .9)  gives 


e0  = n-1 


we  see  that  (4.5.12)  leads  to 


7%^*  ~7~7rnr7  A A 


( sii  ) '(■*«(  ^1)  t'1 


From  the  definitions  (4.5*2)  and  (4.5.3)  we  see  that  if  we  define 


u 


(A 


2 a 


(a) 


s „ s 

a =0 


(0  < r) 


we  find 

,0*1) 


r+1 


u' 


* a0  + L,  “s 
a = 1 


(a) 


~ + L-J  i—J  \ . 

s*(s+l)(  s“i ) k«l  j=l  J 


2 2 ( . )k(kt-l)  2 a 


a =0 


(a) 

1 


so 


• U(rfl)  = i 


s s 


t t , V 

2 2 ( , )k(fcn)(e,  -u(.r0 


— <-*  \ .1  / *»••**■/  v '*'4 

s(s+!)(  s+i)  k-1  j-1  J J J 


As 


(0)  2n  s+2 

e -uv  ' - e - -77 — rr  > 0 

s s s 3(n-l)  / n , 

^ s+1  } 
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induction  shows 


(U.5.1M 


u(r)  < 


(0  < r)  (1  < s < n-2) 


u^  , u^  , ...  is  hence  an  increasing  bounded  sequence  and  therefore 
s s 

converges  for  all  s = 1,2, . ..,n-2  . The  fact  that 
(U.5.15)  lim  u<r)  = e 


(1  < s < n-2) 


p —4  00 


follows  from  the  fact  that  uv  ' satisfies  (4. 5. 12): 

s 


(®)  v (r) 
uv  ' = I,  ay  ' 

s _ s 

r =0 


- “o  * aS 

r =0 


(ih-1) 


s r 


= ao  + 


~ JT — £ £ (f)r(rH)u 


s(s+!)  ( s+i ) r =1  t*l 


(“) 

s 


From  (4.5.10)  we  find  for  1 < t < n-2 

t (!) 


Pt  " 0 CJ  (j+2)(n-j-l) 


- n-1  2-(Ll)  + ^ (j+2)(n-j-l) 


£ a<a> 
a =0  * 


■S  * = ^ 


(n-1)  a = 0 


proving  (4. 5. 5)  of 
Now  as  siome: 


Proposition 
1 < t < n-2 


4.5.1. 

, we  find  from  (4.5.3) 
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,(r+l)  _ 


i— — £ a(r)  E ((k‘1)+(^))k(k+l) 

t(t+l)(  " ) j=l  J k = j J J 


£ a!r)((  £f)(d+i)(d+2)  + ( ^+1  )J(J+1)) 

t(t-l)(  " ) d-1  j J+"  3+2 


(fV  J 


r a(r)  (A.  ft-lx  + J_  (t-1^ 
j V J+3  ( J ; j+2  1 j-1 j J 


hence 


<*•’•“>  41"3  < ,Mr,c;>& 


<»v  j*1 


Now,  from  (4.5.4)  we  easily  deduce 

(4.5.17)  at(r)  = (fc+2)(n-t-l)  £ ( ^ )(-l)t_J  &$r) 
t j =1  0 J 


t t 


s ajr><l>F2  ■ .s.  s.  <-»3~k  ir) 


provided  1 < t 


k = 1 j =k 


j ''  k 


= £ ^(M  ^ (tTk)(-l)j(n-k-l-j) 
k = 1 K j =0  J 

t t “ k 

= £ &^r)(h  £ (n-k-l)(t:k)(-l)J 

k = i k |_j  =o  J 

- Z?  (t-kH^X-D^ 
j =0  J J 

* (n-t-1)  + t 


r we  wasllj  see 


&(r)  <,  i.(r) 

t-1  t 


. , 1.  >•  : < t < n-  w>  find  from  (4. 5*16) 

(I+.5.I8)  b|r)(n-l) 


( “ ) 
' t+1  ' 


t = 2,3,  • . - ,n-2 


(1  < t < n-2) 


(the  latter  formula  easily  being  checked  for  validity  when  t = 1 ). 
From  (1». 5. 2)  and  (I4.5.I)  we  find 


t 

V 

2n 

j+2 

L.  ~ 

3 = 1 

2n 

(n-1)  ( 

1 

n ) 
J+l; 

1 

n(n-l) 

( n;2 ) 

2n 

1 

1 

l(n-l) 

n(n-l) 

' ? ) 

2n 

1 

“ 1 1) 

( j+2) (n-j-l) 


'l  tin*?) 

i =1 


(?) 


A (S^)(J11) 


- ( 


n;2>] 


( nr ) 


- 1 


and  hence 


(t.5.19)  6. 


(0) 


2n  f 1 _ 1 ^ 

JTn-37  ^ ( n-t) (n-t-1)  n(n-l)  J 


'sing  <'14.5. 18)  in  C U . S . U)  we  get,  when  1 < t < n-1*  : 
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6ty  < ,Vi  7^77  6j°)(n'1)  i>2)A-j-i) 


f"  £ — - £ — — 

j = 0 (n-j-l)(n-j)(n-j-l)(  J+1  ) j=0  ( ^ ) (n-j-l)n(n-l) 


< t n £ — “ 

j=°  ( j+1  )(n-j-l)(n-j-2)(n-j-3) 


- 2 f,  \ 1 

5(n-l)  [_  (n-t-1)  (n-t)  J 


fn4un4-jujtl. 

2 1 £ l .j  Mn-4-tJ(  1 / 1 1 

3 n ( n74  ) j = 0 n(n-l)(n-2)(n-3)(n:'+)  ' (n-t-1)  (n-t) 

^ J 


( \'2  ) 


n(n-l)(n-2)(n-3)(n;4) 


n-l)  n-t -l)  n-t 


2 r 1 _ i “| 

3(n-l)  [_  (n-t-3)  (n-t -2)'  ~ (n-t-1)  (n-t)  J 


■nee  we  find 


(4.5-20) 


bf1)  < % 5 
t n t 


We  will  use  this  as  a starting  point  in  an  inductive  proof  of  (4.5.6) 
(4.5.20)  shows  (4. 5.6)  to  be  true  for  r = 1 . Suppose  it  is  true  for 


r = x . Then,  from  (4.5.4)  and  (4.5.18)  we  find 
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,(**1)  < £ JJiSi  (n-l)  6<x> 


J-l  <£) 


W ‘ i> 

j ( j+2) (n-j-1) 


< nx  (-1>.  x^rfeiy  8f 

3 " ' j+l ; 


aruKo)  ■ ar\ 


as  in  the  proof  of  (4.5.20).  This  is  (U.5 .6 ) and  hence  Proposition  4.5.1 
is  proven. 

We  proceed  to  prove  Proposition  4.5.2. 

Inserting  (4.5.12)  in  (4.5.10),  using  (4.5*9): 


e0  \ T) 
ca  = 2(n-l)  + r^1  ®r  (r+2)(n-r^ 


en  a 


a r s t 

+ £ r E £ 

r = 1 s = 0 t =0  u = 0 


:> 

2 

r+2 

(n-r-l) 

5 

( n ) 
1 rfl  ' 

( a) 
' r 

1 

:rt2)(n-r'1)  (rt)r(  " ) 4 


( f )s(s+l) 


.(t+2)(n-t-l)(-l)t  U(^)pu 


From  (4.5.10)  we  see 


c0  1 
p°  ' ^ ^ 


and  from  the  proof  of  (U.5.I9)  we  have: 
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T ^ r ) 2 (r+2)  2 f 1 1 > 

r = 1 (r+2)  (n-r-1)  3 / n . 3 ^ (n-a)  (n-a-1)  ' n(n-l)  J 


so  we  obtain: 


'>•5.21)  pa  3 (’(n-a)  (n-a-1)  " n(n-l))+  ^^2  + T 


where  T is  the  last  sum  in  the  previous  formula  for  p • As  T =0 
a a a 

when  a = 0 (sum  being  empty),  we  see  that  (4.5.21)  is  valid  for  a = 0 
also . 

To  evaluate  T we  shall  consider  the  stuns 
a 


aur=  E Z (t)(s(s+l))(t+2)(n-t-l)(-l)t-u(M 
’ s =u  t =u 


so  that 


(a) 
V r / 


a r 

£ £ 

r = 1 u = 0 r(iM-l)(r+2)n(  ^ ) P&  ^U,T 


Now: 


i (t+2) (n-t-l) (-l)t-u( ^ ) ( ? ) 
t =u 


(®)  £ (t+2)  (n-t-l)  (£“)(-.  l)t_U 

t = u u 


s-u 


(U)  L ( (n-u-1)  (u+2)  + t(n-2u-U)  - t(t-l) ) ( s~u  ) (-1) 
t = 0 
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I 


-l)(u*2)  iT  (S:U)(-l)t  -(n-2*-k)(S-u)  I ( B:U"i)(-l)1 
t =0  t =0 


- (8-u)(8-U-l)  LU  (E:U'2)(-l)t_2 

t=o  t2  J 


and  hence 


(n-u-1) (u+2) 

if  s = u 

-(n-2u-U) (u+1) 

if  s = u+1 

-(u+2) (u+1) 

if  s = u+2 

a 

V - 

(a) 
V.  j.  ! 

P-  n_l  Mv  r 

= 1 r r(r+l)(r+2)n(  ^ J ) r,r 

a 

. y . 

( a) 

+ pr-l 

r = 1 

r(p+l)(r+2)n(  ) 

a 

+ z — 

( a ) r-2 

r z 

r=2  r(r+l)(r+2)n(^) 


a ( j 

£ p — T"  r(r+l)(n-r-l)(i+2) 

r = 1 r r(rH)(r+2)n(  J^) 


+ z 


r(r*l) (r+2)n(  “t ) 


^ZT~  ( 0*1)  (*+2)  (*+!)(*)) 


♦ z — 

r = 1 r(rH)(r*2) 


a ) r p_i 

TT“  ^ pJu(u+l)(n-u-l)(u+2) 

•2)n(l,)lu.o  u 


- (u+1)  (u+2)  (n-2u-U)  (u+1)  - (u+2)  (u+3)  (u+2)  (u+1)  ] 


Wp2»  /«  * ci> 


a ( “)  a 

V r / 

PT  — ■ — + 


E c — + E P . 

r / n x , pr-l 


r = i (^  r-1 


(;)m 

( r+1 ) (n_r-l) 


a-1 


- ^ P„(u+l)(u+2)(n+2)  E 


u =0 


u 


(?) 


r=u+l  r(rH)(rt2)(^)n 


Inserting  this  in  (4.5.21)  yields 


- if  1 , 1 > + 1_ 

3 ^ (n-a-1)  (n-a)  n(n-l)  J 


2 pa  / n 


( 11  ) 
^ a+1  ; 


n(n-l)‘ 


a-1 

+ E p 


( ) a-1 

r + E 


( 


r=°  r (^1)  r=0  r ( r+2 ) (n-r-2) 


a-1 


£ pu(u+l)(u+2)  E 


(a) 

v r ' 


u =0 


r=ufl  r(r+l)(r*2)(^) 


and  we  easily  see  that  we  have  proven  Proposition  4.5.2. 


4.6  Proof  of  Proposition  4.1.1. 
From  (4.5.6)  we  find 


° < £A0)<M0) 


r = 1 


1 < t < n-4 


So,  bringing  in  (4.5.19)  together  with  (4.5.5)  we  find 

1 


2n  1 

pt  3(n-l)  (n-t) (n-t-1) 


3(n-l) 


2 + tj.  (1  < t < n-4) 


where 


69 


0 < t. 


. 2n  ( 1 1 \ 5 

5(n-l)  (n-t)  (n-t-l)  " n(n-l)  ) (n-5) 


so,  as  pt  = 


we  find 


n)  - 2 _ + Tl(n)  — • M.  4 < t < n-2  , 0 < M.  < M 

,1  it(t+l)  3n(n-l7  + n t - - t 


r(n) 
t 


where  M is  some  uniform  positive  constant  (at  least  less  than  26  )• 
This  proves  the  first  statement  of  Proposition  4.1.1,  as  the 
formula  for  t = n-2  is  trivial. 

(4.5.6)  is  not  valid  for  t = n-3  or  t = n-2  , so  we  have  to  treat 
these  two  cases  separately. 

We  introduce  the  notations : 
n-2 


(4.6.1) 


,(r) 


St~'  = S &t 
t t-1  * 


(r) 


(0  < r)  1 < t < n-2 


f r) 

and  shall  concentrate  on  first. 

We  have 


(4.6.2) 


3<*>  = 
n-2 


,(r) 


n-2 

t 

= £ 

E 

t =i 

j =i 

n-2 

a<r)  - 
j 

= £ 

j =1 

n-2 

aW, 

- E 

j =i 

,<*) 


(t> 


,n-1) 

\ A4.-\  / 
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* 


We  find  from  (4.5*3) 


5(r+l) 


Z T 7 ^ C*^r)  [k(let-X)  ( ) + (le«-X)(le«-2)  ( ) ] 

J-l  (tHl)nJ(j+l)(  k«l  k k+2  k+3 


z UDfi  n£2  + nv2  (3-1,1 

= 1 n \ l_k+2  =k  ( k-l)  k+3  17/.  ( k ) J 


, % n_2  r - 

(4.6.3)  S^1)  = £ — I — ( n'2)  + — ( n’2)l 

n-2  k = 1 n [_  k+2  k ' k+3  [ k+1  ' J 


Inserting  (4.5.2)  in  (4.6.3)  we  find 


» 2 

V 1 2n  k*2  r 1 / n-2  x 1 , n-2  x "1 

_i  n 3(n-l)  . n . I [k+2j  ( k ’ TF5T  ( k+1 } J 

^ k+1 ' w 


2 r nv2  (k+1) (n-k-1) 
n ( n-l)  ' 


(k+2) (n-k-1) (n-k-2) 
(k+3)n(n-l) 


and  eventually 


(4.6.4) 


2 (n+l)(n+2) 

3 (n-l)n 


1 . JjL  + _A_]  . it  (n-2) 

2 n-l  n-l J 9 n(n-l) 


Similarly,  as 


. —]  n E S ( J)J(d+l)  T7~ty 

t(t+l)(  “ ) j ,1  k = l k 3(n‘1'  (,n 


3(n-l)t(t+l)(  “ ) j 


( ) 
{ k+1 ' 


n-k-1 * , k+2  . 
n-i-1  2 ; 


- 2 ( n-i  ) 
n k j ; 
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4n  (m-1)  (rt+2)  y j(j+l) 

3(n-l)  t(t+l)(  n ) j=l  (n-jHn-j+l)(n-j+2) 


- -2-  a(°) 

5n  t ' 


we  find  from  (4.6.3) 


(2)  _ 4(rtfl)(rH-2)  f n£»2  / n-2  } _1_  ( n-2  A 1 

n-2  ‘ 3(n-l)n  1 t"1  t+2  * t ' t+3  K t+1  >)  t(t+1)(  n 


j =1  ( n - j ) ( n-  j+ 1)  ( n- j+ 1)  J 3n  bn-2 


UC f n£f  £ / . — _ 

3(n-l)2n  \t=l  j =1  (n-j)(n-J+1)(n-j+1) 


( lg± 

\ MU 


t(t+2)  t(t+l)  (t+3) 


3n  n-2 


After  tedious  computations  we  find 


(4.6.5)  s£l  = 


17  . 1 ( §2  „ & _2_  16  88 

= ' 9Tn-lT  n(n_i)2  V 27  9 ‘ 27n  “ 


Hn  : 

n-1)2  V5  n 


n(n-l) 


+ Tn+T+  9^+  9(^31“^)  * 


2.  it 

3 


14  . 4 , 32  16 


We  shall,  however  use  approximations  and  write 


(4.6.6)  S 


w , j. I 0fH-s 


n-2  3 3 n 91* 


"Z 

n 


<«•« 
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we  will  use 


In  order  to  find  approximations  for  6^2  and  6^2 

n-2  n-2 

the  following  formula 


IU.6.8)  s'1:1'  . E,(r>  . S(r>  * Sai  s'"1)  . 

v ' n-2  n-2  (n-1)  n-2  n-1  n-2 


From  ( U . 5 . U)  we  have 


= if  a!r> 


( nf  ) 

;j+2)('n-j-l) 


nv  Jr),  n-1'  1 

, , j ( j ‘ ( .1+2)  (n- 


and  similar  to  the  proof  of  (4.6.3)  we  find 


>1) 


n 2 

1 . V a(g)  / n-1'  1 , n-2  ' kfl  , n-1 ' 1 .n-2'] 

n(n-l)  “1  j [_k+2  ' k+1 ; k+2  ' k ' k+3  ( k+2  ' k+3  ( k+1  > J 


( n_1  -i  1 n , n+1  1 2n-l  if  1 , n-2 . 1 , n-2  . "N 

1 k ’ (k+2)  (n-1)  " (n-1)  K k+2  ' n(n+l)  (n-1)  n ^ k+2  ^ k ' k+3  [ k+1 } J 

1 k ( n_1  \ + 1 / n-2  >.  k+1  , n-1 ' 1 , n-2  , "] 

' n(n-l)  |_  k+2  ^ k+1  ; k+2  1 k ’ k+3  [ k+2  ’ k+3  ( k+1  ‘ 


( n_1  1 r 1 n 2n-l  n-k-1  k(n-k-l)  (n-k-1) 

k (k+2)  [_  (n-1)  (n-1)  (k+1)  n(n-l)  (n-1)  " ( k+1) n( n-1)  " j^.-g2 

, n-2  x 1 2n-l  n-1  k+1  1 “1 

' k+l;  k+3  [_  (n-l)n  ' Tk+2l  n(n-l)  ' n('n-i)  J 


, n-1  ^ 1 (n-k-1) (n-k-2)  , n-2  s 1 

' ( k ' k+2  n(n-l) (k+1)  ' ( k+1  > n(k+2) 


= 0 . 


So,  according  to  (4.6.2),  (4.6.3)  and  the  two  formulae  above  for 

n-2 

and  we  have  (^*6-8). 


itfMMn 


From  (4.3-19)  we  have 


n-1)  (n-t-1) (n-t 


3(n-l)‘ 


1 < t < n-2 


and  also 


(U.  6. 10)  Sj; 


2 . _L  + 0 

5 3n  °l2 


From  (4.6.6)  - (4.6.8)  we  then  find 

S(1)  = 8(°)  _ _D_  s(0)  + 2n-l  (1) 
n-2  n-2  n-1  n-2  n-1  n-2 


giving 


tl  - t 


and  similarly 


5^2)  = &(1)  _ _n_  s(l)  2n-l  (2) 
n-2  n-2  n-1  bn-2  + n-1  Sn-2 


.12)  &(2)  - £ J.lI+0fM 

' n-2  3 n 9n  °l  2 I 

From  the  above  formulae  (4.6.6),  (4.6.7),  (4.6.9)  - (4.6.11)  we  easily 


,(0)  1 1 

3n-3  = 5 " n 


s*1!  . | !!s.g+o[i 

n-3  3 n 9n  2 

V n 


I 


(1)  _ 4 (2)  k2  i + 

n-4  " 3 H 27  n+0  1 

V n y 


(4.6.19) 

Together  with  (4.6.14)  we  arrive  at 

(4.6.20) 


&(D  - £ i r n 4 ,.(2)^  1+  J\ 

n-3  - 5 n 27  + 3 Hn  j n + °l  ^2 


From  (4.5.1)  and  ( U . 5 • 5)  we  see 


i = £ s<r> 

n-l  - n-2 

r =0 


sc  that  from  (4.6.10),  ( 1+  .6 . 6)  and  (4.6.7)  we  see 


(>•.6.21)  £ S irl  . 1 - -i,  - S<°>  . s'H  - s'2> 

r _ z,  n-2  n-l  n-2  n-2  n-2 


Vn  j 


proving 


(4.6.22) 


cn-2  - 1 

(n_1)  n 


• ( n) 

leading  to  the  value  for  £ stated  in  Proposition  4.1.1. 
From  (4.6.15)  we  see  that 

« ■ -m 


and  from  (4.6.21)  we  see 


£ 6 

r =2 


(r) 

n-3 


so  that 


0 h 


* 6<°)  ♦ . o[  5} 


°n'}  " (n-l)5  ' ’n-}  ' ”n-2 


Referring  to  (It. 6. 16)  and  (1.6.20)  we  have  then  proven  the  value  of  Tt  n'! 
in  Proposition  4.1.1. 
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CIOJ 


I 


5.  Measures  of  Efficiency  in  S' 


5.1  General  Formulae  for  Basic  Probabilities. 
In  order  to  obtain  the  measures  for  s{n^ 


* 

L = L 


--  the  expected  left  path  length 


— the  expected  number  of  key  comparisons 


* 

R = R 


--  the  expected  right  path  length 


RL  = RL  ^ — the  expected  length  of  the  last  right  subtree 

S1 


* 

C = C 


— the  expected  recursion  depth 


(see  Section  2.4)  we  need  knowledge  of  some  properties  of  the  CLPP  of  s|r 

(n),  * V _(n)  a b 

A)  (z,w)  = 2j  2j  T]v  ' z w 

1 a = 2 b=l  a,b 

f n) 

Formula  (J.4.1),  together  with  the  approximate  values  for  Tp  ^ proven 

a,  l 

in  Chapter  4 could  give  us  values  of  for  general  1 < b < a < n . 

SLj  d 

However,  it  turns  out  that  we  may  express  all  the  quantities  needed  in 
terms  of  's,  without  knowing  the  T)^n^  's  in  general. 

Blj  X <Xf  u 

To  establish  the  measures  above  we  need  formulae  for 

4 


(5.1.1) 

n-l-r  , x 

Xr  = b^1  Cb^b 

0 < r < n-2 

(5.1.2) 

a-l  , v 

^ ■ *ml  tl 

2 < a < n 
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(5-1. 3) 


t,  = 


Sn,  t> 


1 < b < n-1 


Knowing  that 
(5.1.4) 


(5-1-6) 


^“>(1,1)  = E \x 


r =0 


n 

E p, 

a = 2 


(s.i.s)  I4n,(2’l) 


3 A[n^(z,w) 
§z 


n-2 

E K zr 
r =0  r 


n 

E 


z=w=l  a=2 


a 


we  see  that  we  then  will  have  the  sufficient  knowledge  to  establish  the 
measures  needed  (see  Section  2.4). 

We  will  use  the  notation 


ea  “ 


(.-DnW  + (n-a)T,W;1 


(5-1-7) 

First  we  prove 

(5.1.8)  ^ i ^ - 21^-^)) 


1 < a < n . 


0 < r < n-2 


Using  w = l/z  in  (3-4.1)  we  obtain 


n a-1  , . . 

E E 7i(ni  ztt-b 

a =2  b = 1 a,b 


1 

n+1 


i Z1  2a-\b  T|<”)  - (a-b-l)^1  + (b-a)T|W(btl) 


a = 2 b =1 


si  af2  ■ 
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We  have 


(5.1.9)  H<a'1)(«,  i)  * * z [{m-iky  <2sa> 


Rearranging  the  general  equation  we  find 


if  (j+l)\.  zj(l-z)  = l-(n-l)T]^_  + E Pin)(l-za"1  + |l4a-1)(z  , |)) 


From  this  we  see 


‘ ‘ 'Vs"  a jj+3  ( F5  - i?i  k <°  < ■)  £ n-2> 


1 n 1 

when  regarding  the  coefficients  of  z,z  , ... ,z  . Summing  these  equations 

from  j = r to  j = n-2  eventually  proves  (5. 1.8). 

For  the  a 's  we  will  find 
a 


(5.1.10)  u 


1+  E f H.  +H.  , -H  , - ^ "] 

k = rfl  kV  k“r  k-1  r_1  k"1  J 


Putting  w = 1 in  (3.U.1)  we  have 


£ “a2*  - WX  2"  + SI  2 b 1$  - (*-b-l)’£i1  * 

a=2  a = 2 b = 1 


a = 2 


where 


, a-1  r-1 

= £ zr  £ 

r =2  s =1 


a-l-s)  a-s)  (r-l)r 


E zr 


C-i. 

V,  a-r  a-1  r y 


(2  < a) 


giving 


£ (^a(n+l-a)  -yj.a+1(n-a))za  = ^ + £ Hoa_1)(z,1)  • 


As  for  \ above  we  obtain  for  2 < r < n 
r — — 


<wi-^  = i\i 


leading  to  (5-1.10) 


The  t,  ’s  turn  out  to  be 
b 


(5.1.H) 


1 < b < n-1 


as  proven  by  isolating  the  terms  in  (3.4.1)  having  z to  the  power  (n-1) 


■u  T n-1 

b 1 v'  b 


^ Tbw°  = SI  b^1  "V  V (n-l-b)Vl)  + ^ wn-1(n-l)T1  + 


yielding 


Tn-1  = 1+  (n-l)r^ 


Tb(nfl-b)  - (n-l-b)Tbfl  = 0 


1 < b < n-2 


As  t = Tl^n|  = — — is  found  earlier  we  easily  see  (5. 1.11). 

n’  (n-1) 

We  will  also  prove  the  useful  relation 


(5-1.12) 


t/n)  - + 2n  T.  T!^nbH  _ 

*2,1  rH-1  + n+1  \l(Ha-2 


This  is  seen  from  (5. 1.10)  and  the  fact  that 


“2  * tl 
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We  find 


’ k = 3 

= 1 + Hk-2  ” 2 + ill)*  J,,  (n-k+1)i°i(  2Hk 

- 1+  2“  A \,!»k .s-D  + ad-^i) 

k — 3 

from  which  (5 .1.12)  follows  easily. 


5.2  The  Expected  Left  Path  Length 

The  fl  's  defii 

a 

Proposition  4.1.1  by 


The  p 's  defined  in  statement  (5 .1.7)  are  approximated  from 

3. 


(5.2.1) 


2 2 14 

P1  = 3 n - 3 Hn  + T + ° 


e2  = |n+§H.  + ^-  T h!2)  + 0| 


3 n 27  3 n 


H 

_r 

n 


et  ■ i °{ky 


(n) 

t 


3 < t < n . 


Inserting  (5.1. 10)  in  (5.1.4)  gives 


(5.2.2)  A<">(1,1)  - T,~ 


k-1 


1 +i-  1 


1+,  pkl  ^ t-a+1  ' t ~ k-1 

k = a+1  \t  = a 


= H . + N 
n-1 


where,  according  to  (5. 2.1) 
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Lata  k,a+l  t«'-W1  t k-lJn-l-U  MW  in  ) J 


Straightforward  calculations  lead  to 


(5-2.3) 


N = t «2  + t H - 

3 n 3 n n 


-i-ott 


and  hence  from  (2.4.6) 


(5.2.4) 


L*  = i H2  + \ H - 

3 n 3 n n 


}.Ji 

3 In 


The  expected  length  of  the  left  path  has  increased  from 

2 H - 1 
n 

in  the  normal  p-tree  forest  to  the  value  given  in  (5-2.4) 


5-3  The  Average  Number  of  Key  Comparisons. 

The  formula  for  the  expected  number  of  key  comparisons  in  the 
stationary  p-tree  forest  is  found  from  (2.4.10),  (5*1.6)  and  (5. 1.1)  to  be 


(5.3.1) 


i n n-2  , ... 

\?0  51  SFW  S 


where  S is  the  corresponding  value  for  the  normal  p-tree  forest, 

F0 

defined  in  (2.4.13). 

To  establish  a formula  for 


T = 1 + 


TW  r?2r,Ir 


we  see 
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T = 1 + 


r = 2 


1+A^n)(l,l)  -K 


where 


(5-3.2)  K = 


rn-1 


k-r 

k-1 


n+1  + rtf-1  k^5  Pk^k-2^(Hk-l  " 2 ) 


Defining 

(5-3-3)  U 


n-2 


^ St  s ^ \ 

k = 0 nfl  F^k  k 


we  have  from  (5-3-1)  and  (2.U.6) 

(5-3-1*)  S*  = L*  - K + U 

To  evaluate  U we  use  (5. 1.8)  and  (2.U.13) 


u = snki 


\ £ a*  X ( - 2<Vi  - w ) ( \ <4*  - ■*£ > « 


k = l 


r =2 


The  latter  inner  sum  simplifies  nicely  and  we  obtain  eventually 


U 


' *1  kl  * »1  pk(  k Hk-1  " 2 k ‘ 5 + ra  ) 

+ 2l.  Jta 


f,  ST'<kBk-l-ik-5+kil*tT-2(»k-l-H2» 


k .} 


from  (5.I.8)  with  r = 1 . 


Rearranging  the  terms  yields 


u - J,  51  <k-2>(  Hk-i  - 5 J 

Referring  to  (5.3.2),  we  see  that 


(5-3.5) 


U = K - 


n-1 

n+1 


and  hence,  by  insertion  in  (5*3.4) 


(5-3.6) 


* * n-l 

S = L - 

m-l 


Using  the  approximate  value  of  L from  the  previous  section  gives  us 

,'H2' 

Tocr  /n\  *7  1 

(5.3.7) 


S*  = 7 H2  + | H - - 2.  + o(  — J . 

3n3n  n 3 \ n 7 


The  expected  number  of  key  comparisons  is  hence  slightly  less  than  the 
expected  left  path  length,  and  has  the  same  dominating  term  as  the 
corresponding  quantity  of  the  normal  p-tree  forest,  being 


S - i h2  + H 

S - 3 Hn  + 9 n 


. i „(2)  . £8 
3 nn  27 


Formula  (5.3.6)  is  surprisingly  simple,  indicating  that  there  should  be 
an  easier  way  to  prove  it  than  the  one  we  have  been  using  here. 


5.U  The  Expected  Length  of  the  Right  Path. 

From  (2.4.14)  and  (5. 1.11)  we  find  the  expected  length  of  the  right 

A 

path  to  be 

(5,U,1)  R xl+k?oRFO0  (n-TRk?I)(k+2)  • 
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being 


In  [2]  is  quoted  the  recursion  formula  for  R 


(5-4.2) 


>>  = 1+  k^o 


From  these  two  equations  we  find 


(5-4.5) 


B*  * * (n)  + 33  (R  („)  - « - -^72  X « (k) 
F~  F (n-1)  k=0  Fv  ' 


From  (5-4.1)  we  find 


(S.i.i) 


1 , * . 
= — (R  ,.+  1 -R  ) 

n2  F(0n) 


R (n)  ls  to  be  a nondecr easing  sequence  of  positive  real  numbers, 


approaching  the  limit 


R = S 


J=o  ((j+i)i)‘ 


= 1.6261... 


(5.4.>)  and  (5*4.4)  show  that  the  R have  the  same  properties  as  R 


5-5  The  Expected  Length  of  the  Left  Path  of  the  Last  Right  Subtree. 

From  (2. It. 15)  and  (2.4.21)  we  find  the  expected  length  of  the  left 
path  of  the  last  right  subtree  in  the  stationary  p-tree  forest  to  be: 


k?'1  T|k+2,1(  2Hk-1) 


Referring  to  (5.1.12)  we  find 
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EL'  ■ 2 y + C 

K - j K = > 


(n) 


^ v-l  + (1-\i} 


(5.5.2) 


“■*  = 1 -H  ^-\i) 


Inserting  the  approximate  value  for  ^ in  Proposition  4.1.1  we 


find 

(5.5.3) 


* 7 H 

^ = 1 ‘ i + 0 T 

V n 


5.6  The  Expected  Recursion  Depth. 

Inserting  the  values  of  the  expected  recursion  depth  in  the  normal 
p-tree  forest: 


(5.6.1) 


,(n) 


2 H + ± 

3 Hntl  9 


(n  > 2) 


= C = 1 


P(0)  " %(1) 


in  (2.4.16),  yields 


(5-6.2) 


* 1 I n“2 

c - 1+  sfil  V2V£ 

K — c- 


(^ScCf  "l,l*  §) 


Using  (5-1.8)  the  latter  sum  becomes: 
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n-2 

s 

k =2 


(^Scd  Hkn*  |) 

6k(rf2  ( - ^-r«»i))(  I 

It  H,.  , + k-12  + T^-  j 


Hr+1  + 9 


n 

s 

k = 3 


pk^  " “k-1  ’ ' k- 


Again  using  (5. 1.8)  for  r = 0 and  1 we  see 


* 1 
c = 1 + ^7T  3c  + 


nfl  M2  rn-1 


Jr  £ 3(ritH  +k-12+T-dr  + - 

+ 1 , , k\^  k-1  k-1  k-1 


k = 3 


2Kk-i+2Hi 


+ XT  - 2Hk-l+2H2 


and  eventually 

(5.6.3) 


c ■ 1 + su  £ ■ 

k = 2 


Inserting  the  values  from  (5-2.1)  we  find: 


*2  1 I Hn 

C = |H  - 7 + 0 - 

3 n 6 V n 
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